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Preface 



This volume contains invited and contributed papers presented at the eighth 
edition of the International Summer School on Neural Networks, dedicated to 
Prof. Eduardo R. Caianiello. The school was established in 1996 by Prof. Antonio 
Zichichi, director of the Ettore Maiorana Centre for Scientific Culture in Erice 
(Sicily), and Prof. Maria Marinaro, director of the International Institute for 
Advanced Scientific Studies in Vietri sul Mare, in Italy. The school is held each 
year, alternating between Erice and Vietri, proposing surveys on several research 
fields related to cybernetics studies and human-machine interaction. The eighth 
edition of the school was on cortical dynamics and was held in Erice. 

The contributions collected in this book are aimed at providing primarily 
high-level tutorial coverage of the fields related to cortical dynamics, report- 
ing recent experimental and theoretical results investigating the processing, the 
transmission, and the imprinting of information in the brain, and important 
functions of the cortical area such as cortical rhythms, cortical neural plasticity, 
and their structural basis and functional significance. 

Height surveys, reporting the most recent original results, are offered by 
specialists in the held. Consequently, the volume may be used as a reference 
book on models of cortical dynamics from neuroscience and physics. To this aim 
the volume is divided into two sections: fundamentals of cortical dynamics, and 
mathematical models of cortical dynamics. 

Fundamentals of cortical dynamics deals with problems related to the func- 
tioning of the brain in complex biological systems, their organization and the 
existence of hierarchies and multiple-scale networks at several levels of miniatur- 
ization and information processing. Fundamental and innovative ideas in neuro- 
science are covered. This section contains three tutorial papers. The first tutorial, 
authored by Bruce P. Graham, discusses underlying dynamic signal processing 
within hippocampal pyramidal cells, their properties and their functioning as as- 
sociative memories. The second, by Luigi Agnati and colleagues, describes and 
accounts for various basic features and functions of the central nervous system 
(CNS), suggesting that the understanding of its functioning is better explained 
by hypothesizing nested hierarchical structures of computational devices that 
process incoming input signals at different computational levels of miniaturiza- 
tion and then integrate both spatial and temporal relationships among informa- 
tional elements and computations, exhibiting emergent behaviors. Finally, the 
tutorial by Alessandro Treves reports on the physiological changes that may have 
occurred during the evolutionary process in the neuronal circuitry and may have 
contributed to the development of a fundamental neuronal mechanism such as 
the bring rate adaptation. 

Mathematical models of cortical dynamics deals with the mathematical mod- 
eling and computer simulation of the brain’s functions. Inspired by the recent 
discoveries on the chaotic behavior of neuronal cells and supported by the need 
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to find solutions for approximating the probability distributions of the cortical 
cell’s dynamics, this research aims to describe brain dynamics using theoretical 
neural networks models. To this aim it appears that mean field theory (MFT) 
and chaotic theory offer realistic interpretations. The first paper in this section, 
authored by John Hertz and colleagues, shows how mean field theory is able 
to model appropriately the dynamics of dense, randomly connected cortical cir- 
cuits, their firing statistics and the correlations among them. The second paper, 
by Fortunato Tito Arecchi, describes the process of “feature binding”, i.e., the 
capacity of the brain to combine external data with internal memories into co- 
herent patterns of meaning, and its modeling through lromoclinic chaos (HC). 
The following three papers demonstrate how several characteristics observed in 
the dynamic activity of the brain can be mathematically modeled in terms of 
the information theory of chaos. The first paper of this series, authored by Ichiro 
Tsuda and Hiroshi Fujii, provides a mathematical model that accounts for the 
instability and the nonstationary features of the cortical activity through the the- 
ory of chaotic itinerancy (Cl). The second, authored by Ichiro Tsuda and Shigeru 
Kuroda, proposes a computational model for the formation of episodic memory 
based on cantor coding theory. Finally, the last paper of the series, authored 
by Hiroshi Fujii and Ichiro Tsuda, shows how the behavior of some interneuron 
systems in the neocortex generates a variety of synchronous inhibitory rhythms 
that could be mathematically described by an expression of chaotic itinerancy 
between pseudoattractors. 

The editors would like to thank the University of Salerno, the Regione Cam- 
pania, the Ettore Majorana Center for Scientific Studies, the Italian Neural 
Network Society, the Italian Ministry for Education and Scientific Reserarch 
(MIUR), and the International Institute for Advanced Scientific Studies “E.R. 
Caianiello”, for their support in sponsoring, financing, and organizing the school. 
In addition, the editors are grateful to the contributors of this volume whose work 
stimulated an extremely interesting interaction with the attendees, who in turn 
shall not be forgotten for being so highly motivated and bright. This book is 
dedicated to the memory of Prof. Eduardo Renato Caianiello, who disappeared 
10 years ago from the scientific scene. His work and his dedication to research in 
cybernetics is still alive in all of us and continue to stimulate our research work. 



Peter Erdi, Anna Esposito, Maria Marinaro, Silvia Scarpetta 
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Abstract. A major challenge to understanding cortical function is the 
complexity found both at the single cell and microcircuit levels. This 
review covers theoretical studies aimed at elucidating dynamic signal 
processing within hippocampal pyramidal cells. This processing involves 
both the intrinsic pyramidal cell properties as well as the microcircuit 
of inhibitory interneurons that synapse onto the cell. These factors are 
considered within the framework of associative memory function in areas 
CA1 and CA3 of the mammalian hippocampus. 



1 Introduction 

Considerable detail is now known about the individual neuronal cell types found 
in the cortex, and the circuits they form. However, mathematical models and 
computer simulations typically concentrate on a particular level of detail, either 
the single cell, or networks with simplified cellular descriptions. It is both pos- 
sible and desirable to try to formulate functional models of cortical networks 
that include known details of all cell types and their detailed microcircuitry. 
This review considers theoretical modelling studies that cover aspects of the 
function of the mammalian hippocampus. Firstly, details of the microcircuitry 
formed by pyramidal cells and the variety of inhibitory interneurons in regions 
CA3 and CA1 of the hippocampus are given. Then single cell studies of hip- 
pocampal pyramidal cells are introduced. Finally, certain network-level models 
that treat these hippocampal areas as associative memories are described. Only 
models that attempt to include biophysical details of the cell types and realistic 
microcircuitry are included here. The emphasis is on providing an overview of a 
breadth of work that is not usually considered together, at the expense of depth 
in any particular aspect. 

Associative memory is one of the oldest artificial neural network (ANN) 
paradigms. More than any other ANN, it is also plausibly a model of how cer- 
tain brain regions may operate. Of particular interest here is the mammalian 
hippocampus, in which regions CA3 and CA1 have been proposed to be auto- 
and heteroassociative memories, respectively [80]. This has led to the formula- 
tion of biophysically realistic network models of associative memory based on 



P. Erdi et al. (Eds.): Cortical Dynamics, LNCS 3146, pp. 1-23, 2004. 
(c) Springer- Verlag Berlin Heidelberg 2004 




2 



Bruce P. Graham 



the architecture and operation of these hippocampal areas [55,82]. A number of 
factors must be considered when moving from an ANN model to a biophysical 
model, including: 

— how are patterns of information coded by neuronal spikes? 

— what, if anything, constitutes a rhythmic clock cycle? 

— how are storage and recall modes separated in space and time? 

— what roles do the different neuronal types play? 

The work described here makes the premise that gamma frequency rhythms 
(30-80Hz) may constitute a basic clock cycle [9, 45, 55] . A slower theta frequency 
rhythm (5-12Hz) is superimposed on this clock cycle and controls phasing of 
storage and recall [23,26,82]. This is based on the hippocampal activity seen 
in rats exploring a novel environment, absorbing and storing new spatial infor- 
mation [60]. These models do not attempt to include all behavioural states in 
rats, such as associated with sharp waves [7], nor necessarily any states found 
in other animals, particularly primates. Nonetheless, they provide explicit bio- 
physical formulations of associative memory operation and provide an excellent 
viewpoint from which to try to understand neuronal cellular and microcircuit 
functioning in a broader context. 

2 The Hippocampal Microcircuit 

The dominant cell type in many areas of neocortex and hippocampus is the pyra- 
midal cell (PC). The outputs from these cells are excitatory and the networks 
they form are likely to be the principal information processing structures in these 
brain regions. When ANNs are considered as models of the brain they are typi- 
cally being equated with networks of PCs. Pyramidal cells, both in hippocampus 
and in neocortex, are surrounded by a variety of inhibitory interneurons (INs). 
These INs differ in morphology, pharmacology and connectivity [17,46,52]. Un- 
derstanding their functional roles is a great challenge. ANN models usually con- 
tain only a single cell type (PCs) within a simple network architecture. Many of 
the details of the microcircuitry involving pyramidal cells and these interneurons 
is now known for the CA1 and CA3 regions of the hippocampus [17]. The basic 
hippocampal microcircuit is shown in Figure 1. 

The first feature to note is the spatial segregation of excitatory input from 
different pathways onto a PC [34]. In CA1, the Schaffer collateral input from 
CA3 PCs is largely to stratum radiatum (SR), constituting the proximal region 
of the apical dendritic tree. Recurrent collaterals from other CA1 PCs synapse 
on the basal dendritic tree (stratum oriens: SO). Perforant path input from 
layer III of entorlrinal cortex (EC) reaches the distal part of the apical dendritic 
tree (stratum lacunosum-moleculare: SL-M). In region CA3, input to stratum 
radiatum and stratum oriens is largely from other CA3 PCs. Input to the distal 
apical tree comes from layer II of entorlrinal cortex. A third excitatory input in 
CA3 comes from granule cells of the hippocampal dentate gyrus which form the 
mossy fibre synapses in the very proximal region of the apical tree. 
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Fig. 1. Microcircuit architecture of INs surrounding a PC in the CA1 or CA3 region 
of the hippocampus. Small triangles are excitatory synapses and small circles are in- 
hibitory synapses. Adapted from Paulsen and Moser [62]. See text for details. 



Though a complete catalogue of interneuronal types remains to be deter- 
mined, several classes can be distinguished on anatomical and pharmacological 
grounds [17,46,52]. These include basket cells (BC), bistratified cells (BSC), 
axo-axonic (chandelier) cells (AAC) and oriens lacunosum-moleculare (horizon- 
tal) cells(O-LM). These INs are all inhibitory GABAergic cells. As illustrated 
in Fig. 1, like excitatory afferents, different IN types target specific spatial re- 
gions on PCs. They also receive excitatory input from particular pathways and 
may form synaptic (inhibitory) and gap junction (excitatory) connections with 
other INs. Other cell classes include horizontal and radial trilaminar cells and 
INs that only synapse onto other INs [17]. A subclass of horizontal trilaminar 
cells (HTC) send axon collaterals out of the hippocampus to the medial septum 
(MS). There may also be inhibitory projections between hippocampal subfields 
(CA1 to CA3). 

In addition to targetting specific dendritic localities on PCs, different classes 
of IN also exhibit a specific spread in their network connectivity across cells. 
Typically, an IN makes synaptic connections within a defined local area, unlike 
PCs that make widespread connections. Total IN numbers may be only 10% of 
the total cell population, with a single PC innervating hundreds of INs and an 
IN innervating several thousand PCs [17]. The spread of IN connections may be 
highly focussed. For example, a single O-LM cell makes connections in the distal 
dendritic tree of PCs with a spread that exactly matches the spread of excitatory 
input from a single entorhinal cell onto the same PC dendritic location [17]. 
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The nature of the inhibitory connections also differs with IN class. Basket and 
axo-axonic cells make several powerful connections onto the perisomatic and 
axon initial segment regions of a single PC, respectively, indicating the ability to 
strongly influence spike generation and output in the PC. In contrast, bistratified 
cells make more diffuse connections, with no more than one connection on a 
particular dendritic branch [17]. 

There is not a one-to-one correspondence between morphology, pharmacol- 
ogy and electrical activity in INs, making classification very difficult [17,46, 52]. 
Basket cells, in particular, include at least two types that are distinguishable on 
pharmacological grounds and appear to have distinct functional roles within the 
hippcampal microcircuit [16]. In the associative memory models to be described 
later in this paper, the basket cells are likely the parvalbumin (PV)-containing 
cells. Recent data indicates that certain cell types may be distinguished by their 
firing patterns in different brain states [41]. 

What does the microcircuit do? Most well studied, both experimentally and 
with mathematical modelling, is the contribution of the microcircuit to generat- 
ing and stabilising oscillatory activity at different frequencies [78]. Though the 
mechanisms that produce oscillations are not the focus here, two different oscil- 
lations, the theta and gamma rhythms, play an important part in certain models 
of associative memory function in the hippocampus. The precise mechanisms un- 
derlying theta (5-12Hz) are complex and include extra- and intralrippocampal 
sources [8,61]. Gamma (30-80Hz) is strongly determined by the intrinsic cellu- 
lar and synaptic properties of the hippocampal microcircuit [83] and may be 
controlled by external inputs to the hippocampus [13]. Here we are concerned 
with the possible functioning of the microcircuit as an information processing 
construct. In the final sections of this paper we will consider the possible role 
of this microcircuit in controlling storage and recall within associative memory 
networks. 

3 Signal Processing in Neurons 

Neurons integrate synaptic input from other neurons for the end purpose of 
producing their own output signals for propagation to receiving neurons. The 
synaptic input also results in signals that are internal and localised within a 
neuron and determine synaptic plasticity. The integration of synaptic input in- 
volves the spatial and temporal summation of signals from different synapses and 
the interaction of these summed signals with the intrinsic membrane properties 
of the neuron. These properties are determined by the typically heterogeneous 
distribution of different types of ion channels throughout the dendritic tree. 

3.1 Intrinsic Properties of Pyramidal Cells 

Increasingly detailed knowledge is being obtained about the spatial distribution 
of ion channels in different neuronal types, as recently reviewed in Migliore and 
Shepherd [59]. CA1 pyramidal cells are amongst the most well characterised in 
this regard. Sodium and calcium channels maintain a relatively uniform density 
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out into the apical dendritic tree, though sodium channel characteristics and 
the type of calcium channel may change with distance [38, 50] . The fast A-type 
potassium channel and the mixed cation H channel increase in density roughly 
6-fold over several hundred micrometers distally in the apical tree [30,47]. 

Volt age- activated ion channels are characterised by the membrane voltage 
range over which they open (activate), and the time course of their opening and 
closing. Certain channel types will remain open while the membrane voltage 
is within the appropriate range. Other channels inactivate with a certain time 
course, that is they will close some time after they have opened, even though 
the membrane voltage may still be within the range at which these channels first 
open. Many ion channels activate when a cell is depolarised, that is, when the 
membrane potential moves towards zero from rest. Such channels include sodium, 
calcium, delayed rectifier (DR) and A-type (A) potassium, and the mixed cation 
M channels. Other types, such as the H channel are activated at lryperpolarised 
potentials. Sodium, calcium and the DR and A potassium channels activate 
quickly with millisecond kinetics. The M and H channels activate with time 
courses in the tens of milliseconds. These channel types will be more extensively 
discussed than others in what follows. Slower channels, such as the calcium- 
activated slow-after-hyperpolarizing-potential potassium channel (AHP), have 
activation dynamics in the range of hundreds of milliseconds to seconds, and 
contribute to spike frequency adaptation. 

Functionally, these voltage-activated ion channels may be classified as ampli- 
fiers or suppressors of changes in membrane voltage. Ion channels whose reversal 
potential and activation range are in the same direction as a change in mem- 
brane potential away from rest act as amplifiers. The current generated by the 
opening of these channels will move the voltage further from rest. Sodium and 
calcium channels are amplifiers. In contrast, various potassium and mixed cation 
(M and H) channels are suppressors of voltage change. Movement of the mem- 
brane potential away from rest into the activation range of these channels is in 
the opposite direction to their reversal potential. Consequently the current gen- 
erated by the opening of these channels will tend to nullify the original change in 
potential. The action potential (AP) is the classic example of the interaction of 
amplification by sodium channels and suppression by potassium channels. Fast 
channels such as these, with dynamics in the millisecond range, are responsible 
both for generating action potentials and shaping synaptic voltage responses 
(EPSPs) as they travel through the dendritic tree to the soma. Slightly slower 
channels, such as M and H, that are suppressors of voltage change, act as high 
pass filters and contribute to electrical resonance in neurons, as will be described 
below. 

A detailed exposition of the different ion channel types found in hippocampal 
pyramidal cells and their dynamic characteristics is given by Borg-Gralram [6] . 

3.2 Signal Integration 

The dynamic characteristics of different ion channels and their spatial distribu- 
tion within dendritic trees gives them specific functional roles for synaptic signal 
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integration. Ion channels contribute to the time course and summation of synap- 
tic input. Here we consider work that addresses how excitatory inputs that are 
widely distibuted across the apical dendritic tree of PCs summate to affect cell 
output in the soma. Do such inputs sum linearly? Will distant inputs have as 
much influence on cell output as those close to the cell body? 

Synaptic Scaling. One consequence of synapses being spatially distributed 
across a dendritic tree is that those synapses that are more distant from the 
cell body may have less impact on the voltage response at the soma than more 
proximal synapses, due to membrane current leakage as signals travel through 
the dendrites. Either distal synapses need to produce larger local EPSPs or sig- 
nals need to be amplified as they travel along the dendrites to overcome this 
disadvantage. There is experimental evidence that the synaptic AMPA conduc- 
tance does increase with the distance of the synapse from the soma in the apical 
dendrites of CA1 pyramidal cells [2,49]. 

Temporal Summation. The rising density with distance from the soma of A 
and H channels in the apical dendritic tree of CA1 PCs may in part determine 
the temporal summation of synaptic inputs. In a series of experimental and 
modelling studies, Magee [47,48] has demonstrated that deactivation of the H 
current may act to shorten the time course of distal EPSPs. This has the effect 
that the temporal summation of trains of EPSPs is independent of their spatial 
location within the dendritic tree. 

In a modelling study, Migliore [57] has demonstrated that activation of the 
A current and deactivation of the H current are instrumental in restricting the 
temporal integration of distal and proximal inputs to a time window of around 
20msecs within which the distal input precedes the proximal input. 

Spatial Summation. For a pyramidal cell to produce an output it typically 
needs to receive a number of contemporaneous inputs. These inputs are likely 
to be spatially distributed across a portion of the dendritic tree. Their spatial 
locations, in combination with the local membrane characteristics, will determine 
how the different inputs summate. 

Various functional scenarios depend on the mathematical form of input sum- 
mation. In a study of pattern recognition by CA1 PCs, Graham [19] considered a 
situation in which a pyramidal cell needed to be able to accurately distinguish the 
number of simultaneous excitatory inputs arriving at random locations within 
the stratum radiatum portion of the apical dendritic tree. The amplitude of the 
voltage response in the soma was used as the criterion for measuring the number 
of inputs. Different spatial distributions of the same number of inputs produced 
slightly different voltage amplitudes, introducing noise into the measurement 
that limited the discrimination that could be made. 

Different characteristics of the synaptic input and the membrane properties 
of the dendritic tree were explored. The basic case consisted of all inputs (in the 
form of single APs) arriving at exactly the same time onto a dendritic tree con- 
taining only a leak conductance (passive membrane). A comparison was made 
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between the distributions of voltage amplitudes for 200 compared with 100 in- 
puts to synapses at different random spatial locations. Amplifying mechanisms at 
each synapse and in the dendritic membrane that boosted distal inputs all acted 
to improve the signal-to-noise ratio between the amplitude distributions of 200 
and 100 inputs. This equates with an improvement in the cell’s discrimination 
of the number of simultaneuous inputs reaching its dendritic tree. The amplify- 
ing mechanisms included (1) scaling of the synaptic AMPA conductance so that 
the EPSP amplitude at the soma of a single input was independent of synaptic 
location, (2) an NMDA component of the synaptic EPSP, (3) a uniform distribu- 
tion of persistent (noninactivating) sodium channels in the dendritic membrane, 
and (4) a uniform distribution of low-voltage-activated calcium channels. These 
mechanisms improved input discrimination when included individually and in 
combination. 

Two extra sources of noise were included. Firstly, rather than the APs ar- 
riving synchronously at all the synapses, the arrival times were uniformly dis- 
tributed across a short interval of 20msecs. Secondly, a random variation in the 
maximum synaptic conductance was added to each synapse to simulate quantal 
variance. As might be expected, discrimination ability was reduced by quantal 
variance. Intriguingly, the temporal variance of arrival times actually increased 
discrimination ability. This was presumably due to a reduction in nonlinear sum- 
mation at nearby synapses and a randomisation of EPSP arrival times at the 
soma. 

Linear and Nonlinear Summation. Using a very detailed model of a CA1 
pyramidal cell, Poirazi et al. [64, 65] investigated the impact of the relative spa- 
tial location of synapses on the summation of their EPSPs. They considered the 
cellular response to active synapses clustered locally on a dendritic branch, with 
a number of clusters on different branches. The computer simulations demon- 
strated that the clustered inputs on a single branch summed nonlinearly due to 
amplification by NMDA, sodium and calcium currents. The peak voltage output 
from a dendritic branch was a sigmoidal function of the number of active inputs 
on that branch. However, the voltage signals propagating from separate branches 
summed linearly in the trunk of the apical dendritic tree due to rectification by 
the A current. Thus they characterised the pyramidal cell as a two-layer network 
in which the input at the soma consisted of the linear sum of a set of sigmoidal 
units, corresponding to the dendritic branches. 

3.3 Resonance 

The dynamics of neuronal membrane and ion channels causes electrical resonance 
in neurons. The membrane capacitance and resistance (or leak conductance) pro- 
vide low-pass filtering of electrical signals. This is most clearly seen if a neuron 
is driven by a subthreshold oscillatory current. The amplitude of the voltage 
response gradually decreases as the current frequency increases, for the same 
current amplitude. In contrast, relatively slowly activating ion channels that act 
as suppressors of voltage change provide high-pass filtering. These channels do 
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not activate sufficiently fast to attenuate high frequency changes in membrane 
potential, but will act to suppress lower frequency changes. This combination of 
low- and high-pass filtering results in resonance. For stimulation by a subthresh- 
old oscillating current of some fixed amplitude, the neuronal voltage response 
will have a maximum amplitude at some frequency of oscillation intermediate 
to the range of low- and high-frequency attenuation. Ion channel currents that 
amplify voltage changes can provide an amplification of the resonance peak. An 
introduction to neuronal resonance is provided by Hutcheon and Yarom [33] . 

A recent experimental and modelling study has demonstrated resonance in 
CA1 PCs [32]. Resonance at both depolarised and lryperpolarised potentials 
was evident, with a resonance frequency of around 7Hz in both cases. This 
corresponds to the theta frequency range. At depolarised potentials, high-pass 
filtering resulted from activation of the M-current, with the resonance peak being 
amplified by a persistent sodium current. At lryperpolarised potentials, high-pass 
filtering was provided by the H-current and little amplification was evident. 

Different neuronal types have different resonant characteristics. In hippocam- 
pus, horizontal cells have a similar resonance frequency to PCs, while fast spiking 
interneurons resonate in the beta-gamma range of 10-50Hz [63]. This is indicative 
of different ion channel distributions in different cell types. Subthreshold reso- 
nance also translates into the spiking regime. An oscillating current amplitude 
that does not normally produce spiking activity will result in neuronal spiking 
around the resonant frequency. 

The functional implications of intrinsic neuronal resonance are not yet clear. 
One possibility is that it acts to accentuate and stabilise network oscillations, 
particularly at theta frequencies. 

Multiple Roles for the H-Current. The lryperpolarisation-activated mixed 
cation H-current has potentially multiple roles in regulating neuronal activity 
[69]. The work of Magee [47, 48] and Migliore [57] demonstrates that deactivation 
of the high density of H channels in the apical dendritic tree of PCs can act to 
shorten the time course of distal EPSPs and so normalise temporal summation 
of inputs. 

In contrast, activation of the H-current results in theta frequency resonance. 
The same effect that produces resonance can also result in PC spiking follow- 
ing lryperpolarising inhibition due to rebound depolarisation through H-current 
activation. The H-current thus plays a role in entraining PC bring to rhythmic 
network inhibition and promoting network oscillations at theta frequency [11]. 

3.4 Synaptic Plasticity 

Internal Signals. As well as determining the spiking output of a neuron, synap- 
tic input, in combination with the intrinsic membrane properties, determines 
voltage and calcium signals that are internal to the neuron. Such signals play a 
role in synaptic and cellular plasticity. 

The standard scenario of Hebbian learning requires a measurement of both 
pre- and postsynaptic activity at a synapse to drive plasticity. The postsynaptic 
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activity may be signalled by back-propagating action potentials (BPAPs), which 
in turn affect the calcium concentration at a synapse. The calcium concentration 
determined by both pre- and postsynaptic activity may be the signal which is 
most directly related to the magnitude and direction of synaptic change. This 
standard scenario is considerably complicated by the possibility of active pro- 
cesses contributing to local dendritic calcium spikes that are not associated with 
BPAPs. Goldberg et al. [18] discuss the consequences of such spikes for cellular 
plasticity. 

Action potentials originating at the soma, or axon initial segment, may be 
actively propagated back into the dendritic trees of pyramidal cells [75, 76]. This 
is due to the high density of sodium channels found in at least the main trunk of 
the apical dendritic tree. The rising density of A-channels with distance from the 
soma, and sodium channel inactivation kinetics, result in an attenuation in back- 
propagating action potential (BPAP) amplitude with distance [30,56,58]. Block 
of the A-channels results in a significant reduction in the BPAP attenuation 
and may allow the emergence of slow calcium spikes in the dendrites [30] . These 
phenomena are neatly summarised in the online supplement to Poirazi et al. [64, 
65], whose model is able to reproduce the effects. 

Mechanisms that alter BPAP amplitude and propagation will also affect 
synaptic plasticity [74]. Thus the A-current in the apical dendrites provides one 
such mechanism for regulating the signals underlying synaptic plasticity. Switch- 
ing off the A-current effectively switches on plasticity. Appropriately timed 
synaptic input that precedes a BPAP achieves this by inactivating the A-current 
for the time window of BPAP propagation [58]. The A-current may also be 
reduced by neuromodulators such as acetylcholine, which acts to shift the A- 
channel activation kinetics to more depolarised potentials [29]. 

Models of Synaptic Plasticity. A number of recent models [40,71,70] have 
attempted to capture the basic interaction between internal and external signals 
in driving synaptic plasticity. These models seek to reproduce data demonstrat- 
ing spike-timing-dependent plasticity (STDP) [5] . If postsynaptic activity occurs 
within a small time window of around 20msecs following presynaptic activity, 
then the synapse increases in strength (long-term potentiation: LTP). If, how- 
ever, the postsynaptic activity precedes the presynaptic activity within a similar 
temporal window then a reduction in synaptic strength (long-term depression: 
LTD) occurs. 

In these models, synaptic strength is altered as a function of calcium entry 
[40,71] or the rate of voltage change at a synapse [70]. Calcium entry occurs 
through NMDA channels and voltage-gated calcium channels. A BPAP with an 
after-depolarising potential is required to reproduce the temporal characteristics 
of STDP. Saudargiene et al. [70] investigated the significance of BPAP shape and 
demonstrated that fast BPAPs, as seen in the proximal apical dendritic tree, 
produce STDP with both LTP and LTD components. However, slower BPAPS, 
as seen the the distal tree, result in only LTP during the time window of STDP. 
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3.5 Neuromodulation and Metaplasticity 

As apparent from the discussion above, signal integration and generation in 
pyramidal cells, and other neuronal types, is a function of external input and 
the intrinsic membrane properties of the neuron. Any alteration in synaptic and 
extrasynaptic membrane properties will change the input-output functionality 
and plasticity of a neuron. Pyramidal cells in the hippocampus receive a wide 
range of so-called neuromodulatory inputs that alter ion channel properties, 
rather than contribute directly to voltage signals. Neuromodulators alter the 
membrane characteristics by blocking or enhancing specific ion channels and by 
altering their dynamics. Such changes may cover the entire cell, or be directed 
to specific spatial locations, such as the pre- or postsynaptic membrane. The 
end result is a change in the voltage response of the neuron to synaptic input. 
Such neuromodulators include acetylcholine, dopamine and serotonin, amongst 
others. 

Modulatory changes may be directed at synaptic plasticity. As indicated 
by the STDP models, synaptic calcium levels [85] are a major determinant of 
synaptic plasticity. Alterations in the calcium response to voltage signals, or the 
relationship between calcium level and changes in synaptic strength will modify 
the learning rule employed by the neuron. This is known as metaplasticity and 
can be effected by neuromodulators and by the history of activity at a synapse 
[1,10,67]. For example, acetylcholine may reduce the A current in CA1 PC 
dendrites, allowing large amplitude BPAPs to reach much of the dendritic tree 
[29], potentially enhancing calcium entry and thus synaptic plasticity. 

Acetylcholine has been given a central role in biophysical models of associa- 
tive memory [22,21,55]. Pyramidal cells and interneurons in the hippocampus 
are subject to multiple effects from acetylcholine via two broad classes of acetyl- 
choline receptors (muscarinic and nicotinic) , that have distinct spatial locations 
and dynamics [66] . The effects of acetylcholine include reducing various intrinsic 
potassium currents, including the A current, and high-tlrreshold calcium cur- 
rents, whilst increasing low-threshold calcium currents [55]. Presynaptic recep- 
tors result in presynaptic inhibition of transmitter release at certain pathway- 
specific glutamatergic and GABAergic terminals [27,28,16]. NMDA currents 
may be increased [66]. These effects combine to significantly alter cellular activ- 
ity and plasticity, and overall network dynamics. The possible functional effects 
for associative memory are considered in the following section. A review of com- 
putational modelling of various forms of neuromodulation has been conducted 
by Fellous and Linster [12]. 



3.6 Summary 

This section has covered a range of work dedicated to understanding the signal 
integration properties of individual neurons, particularly hippocampal pyrami- 
dal cells. Synaptic input impinging on the PC dendritic tree is filtered by the 
intrinsic membrane properties of the dendrites. The resultant voltage signals 
determine the cell’s spiking output as well as synaptic and cellular plasticity. 
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Amplification and suppression of voltage changes by the heterogeneous popula- 
tion of voltage-gated ion channels in the PC membrane clearly modify the effect 
of synaptic input on cell output. Interneurons also exhibit particular populations 
of ion channels that contribute to their distinctive firing patterns [46,52], and 
presumably their functional roles. 



4 Associative Memory Neural Networks 

4.1 Associative Memory 

The associative memory paradigm involves the storage of information so that 
it is content-addressable rather than index addressable. That is, a partial or 
noisy cue pattern is used to recall a pattern stored in the memory, rather than 
the index, or address, of where the pattern is stored. Such a memory is called 
autoass ociative if the cue pattern results in the completion of that pattern during 
recall. The memory is heteroass ociative if a cue results in a different pattern being 
recalled with which it was associated during pattern storage. 

Neural networks can act as associative memories in which memory patterns 
are represented by neuronal activity. Storage takes place via some form of Heb- 
bian synaptic learning rule. A basic rule that strengthens the connection between 
two neurons that are coactive (like LTP) is sufficient. A rule that includes synap- 
tic LTD when pre- and postsynaptic activity is anticorrelated increases memory 
capacity [84]. During recall, the initial network activity determined by a cue 
pattern will result in those neurons that belong to the relevant stored pattern 
receiving, on average, the strongest synaptic input. By the setting of a suit- 
able threshold, through network inhibition, only those neurons belonging to the 
pattern should remain active, completing the recall process. 

A network containing recurrent connections between a single pool of neurons 
can act as an autoassociative memory. A pattern of neural activity is associated 
with itself by the strengthening of the recurrent connections between the active 
neurons. Such a network can also be used to heteroassociatively store a sequence 
of patterns. If the sequence is presented to the network in a temporal order, with 
some short time between each pattern, then a temporal Hebbian learning rule 
can associate the activity of one pattern with the next pattern in the sequence. 
A two-layer network in which one (input) layer of neurons sends connections to a 
second (output) layer can act as an heteroassociative memory. Hebbian plasticity 
of these feedforward connections can associate activity in the input layer with 
activity in the output layer. 

Biophysical Models. In this section we consider models that examine the im- 
plementation of associative memory function in biophysical networks of spiking 
neurons. The models are based on regions CA1 and CA3 of the mammalian hip- 
pocampus. They treat CA3 as an autoassociative memory or heteroassociative 
sequence learner. The CA3 to CA1 pathway is considered as an heteroassocia- 
tive memory. Similar functional subdivisions of the hippocampus have a long 
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history, dating back to the work of David Marr [44,51,53,80]. Attempts have 
been made to specify the nature of the information stored in each hippocampal 
stage, including the dentate gyrus, and how the information is transformed from 
stage-to-stage [44, 80] . We are more concerned here with the precise workings of 
individual stages, rather than the hippocampus as a whole. 

In this construction [27, 28] certain synaptic pathways are modifiable and 
form the site of memory formation. Other pathways provide an external synap- 
tic drive that selects certain PCs to be active. These pathways determine the 
patterns of PC activity to be stored associatively. The mossy fibre input from 
dentate gyrus granule cells onto the proximal dendrites of CA3 pyramidal cells 
acts to select which CA3 PCs are active for a particular pattern to be stored in 
the CA3 autoassociative (or sequence) memory. Synaptic modification by Heb- 
bian learning of the recurrent connections between CA3 PCs stores this pattern. 
A pattern of activity in layer II of the entorhinal cortex is heteroassociated with 
a CA3 activity pattern by Hebbian learning at the perforant path synapses in 
the CA3 PC distal dendrites. Input from entorhinal cortex subsequently can act 
as a recall cue for the CA3 autoassociative memory. Similarly, in CA1, Heb- 
bian learning of the Schaffer collateral input from CA3 PCs onto the CA1 PCs 
heteroassociates a CA3 pattern with a CA1 pattern. The active PCs in a CA1 
pattern are selected by perforant path input from layer III of entorhinal cortex 
onto the PC distal dendrites. Subsequently, a recalled pattern in CA3 acts as a 
cue for pattern recall in CA1. 

Binary patterns are represented by the synchronous emission of single action 
potentials by the relevant pyramidal cells. These active cells can be equated 
with a binary one, whereas the silent PCs correspond to a binary zero. Gamma 
frequency (around 40Hz) network activity constitutes the basic clock cycle [9, 45, 
55], so that a pattern is represented by the spiking activity on a single gamma 
cycle. Recent experimental data indicates widespread synchronous PC activity 
on this time scale [20], supporting this construction. The slower theta rhythm 
(around 5Hz) is used as the frequency of pattern presentation for storage and 
pattern cuing for recall [55,82]. Storage and recall may take place on opposite 
phases of a single theta cycle [23,26]. 

4.2 An Autoassociative Memory Model 

Menschik and Finkel [55] implemented an autoassociative memory network model 
based on the CA3 region of the mammalian hippocampus. In this model, patterns 
are represented by the spiking activity of a set of 64 pyramidal cells. The recall 
dynamics of the PCs is controlled by 8 basket cells (BCs) and 64 axo-axonic cells 
(AACs; chandelier cells), giving a network of 136 neurons in total. The network 
architecture consists of recurrent excitatory connections between the PCs; BCs 
receive excitatory connections from all PCs and send inhibitory connections to 
each other and back to the perisomatic region of the PCs; each AAC receives an 
excitatory connection from one PC and sends an inhibitory input back to the 
axon initial segment of that same PC. Excitatory connections are modelled as 
AMPA and NMDA synapses; inhibitory connections are GABA^ synapses. The 
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relatively detailed compartmental models of Traub and colleagues [77, 79] are 
used for both the PCs (66 compartments) and interneurons (51 compartments). 
Synapses are modelled as simple single or dual exponentials, with no voltage 
dependence for the NMD A current. 

Only the dynamics of pattern recall is examined with the model. The bas- 
ket cell network provides gamma-band synchronized oscillations which form the 
clock cycle of recall. Theta-band GAB As-mediated inhibition from the medial 
septum is modelled as a 6Hz square- wave current injection in the soma of the 
basket cells. Thus the BCs are inhibited (hyperpolarised) on one phase of theta, 
and excited (depolarised) on the opposite phase. This allows one half of a theta 
cycle for pattern recall by PC activity before BC inhibition becomes sufficiently 
strong to silence PCs and thus reset the network for the next cue pattern. Each 
new recall cue is presented at the start of a theta cycle in the form of entorlrinal 
input to AMPA and NMDA synapses in the distal dendrites of those PCs that 
are to be active. 

Pattern storage takes place by using the connection weights of a Hopfield 
artificial neural network autoassociative memory [31] to scale synaptic conduc- 
tances in the biophysical network model. Positive weights scale the maximum 
conductances of the AMPA and NMDA receptors at the PC recurrent collat- 
eral synapses. Negative weights scale the maximum GABA^ receptor-mediated 
conductances at the AAC to PC synapses. For the results presented in [55], 5 
random 64-bit binary strings were stored in the Hopfield net to produce the 
connection weights. 

Recall performance was tested for various levels of cholinergic modulation of 
the network. Cholinergic input to PCs and BCs is modelled as a constant con- 
centration that reduces the maximum conductance of intrinsic membrane AHP 
and calcium (Ca) currents, and reduces the amplitude of AMPA and NMDA- 
mediated EPSCs from recurrent collaterals. Diffuse cholinergic depolarization of 
PCs and BCs is modelled by a constant somatic current injection (which sets 
the DC level of the theta-band square wave injection to the BCs). Increasing 
cholinergic input switches pyramidal cells from bursting to spiking mode due 
to a decrease in the AHP and Ca currents that underly bursting. The spiking 
mode is used for associative memory recall. Menschik and Finkel [55] hypothesize 
that PC bursting mode is suitable for inducing the synaptic plasticity underly- 
ing learning. Calcium entry in the dendrites is far higher during burst firing, 
compared to regular spiking. As mentioned earlier, calcium levels may be the 
dominant signal driving synaptic plasticity. This distinction between storage and 
recall modes has similarities to Buzsaki’s two stage memory model [7]. Menschik 
and Finkel’s role for cholinergic modulation differs from Hasselmo’s proposal 
(see below) that the decrease in recurrent excitation induced by acetylcholine is 
ideal for the learning phase of associative memory [22,21]. 

Computer simulation of the model demonstrated that even highly noisy cues 
provide accurate retrieval of a stored pattern within half a theta cycle (around 
9 gamma cycles or recall steps, given a high gamma frequency of around 100Hz) 
under conditions of normal cholinergic modulation. Decreasing cholinergic input, 
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as in Alzheimer’s disease, slows the gamma frequency and increases the strength 
of recurrent excitation. Gamma frequency is slowed due to a lowering of cell 
excitability and an increase in the AHP and Ca currents that leads to an increase 
in interspike intervals. Both effects result in recall performance deteriorating due 
to a reduction in the number of gamma cycles available for recall and interference 
from other stored patterns through the strong recurrent connections. 



4.3 Phasing Storage and Recall 

The autoassociative memory model of [55] only considers network dynamics 
during recall of stored patterns. A complete neural model of associative memory 
should include the dynamics of both storage and recall and mechanisms for 
switching between these phases of operation. 

The dynamics of recall requires the suppression of synaptic plasticity or a 
weight control mechanism that stops excessive changes in synaptic strengths 
during the recall of previously stored patterns. Principal cell activity must also 
be suitably threslrolded by inhibitory interneurons so that the cells belonging 
to the stored pattern, which are also receiving the strongest excitation, are the 
only ones to become active. 

Storage, on the other hand, requires synapses to be plastic, but excitation in 
the plastic pathway should not contribute to principal cell activity. For example, 
if the recurrent connections between principal cells in an autoassociative memory 
were allowed to contribute to excitation during storage, then previously stored 
patterns would become confused with new patterns, whose representation is 
driven by external activity. 

The role of cholinergic neuromodulation in setting appropriate conditions 
for pattern storage has been explored in a series of models by Hasselmo and 
colleagues [22,21,27,28]. In experimental work with rat piriform cortex and 
hippocampus, they have demonstrated that cholinergic input selectively sup- 
presses layer-specific or recurrent excitation, while promoting plasticity in the 
same pathway [4,24,27,28]. Using rate coded, as opposed to spiking, models of 
CA1 [27] and CA3 [28], they have explored how these cholinergic effects improve 
storage in associative memories. 

These same models [27, 28] also explore how feedback regulation of choliner- 
gic input can effectively switch the memory network between storage and recall 
modes. In the hippocampus, the major source of cholinergic input is from the 
medial septum. Specific interneurons in CA1 and CA3, that are excited by pyra- 
midal cells, send long-range inhibitory connections to the medial septum [17]. 
In the models, excitatory drive to the pyramidal cell network by a novel pattern 
on the pattern-determining pathway results in low activity in the PC network. 
Consequently, inhibitory feedback to the medial septum is low and cholinergic 
input from the septum is high, setting conditions for pattern storage. In contrast, 
excitation (cueing) from a previously stored pattern leads to high PC activity as 
recall proceeds. Feedback inhibition to the septum then also increases, reducing 
cholinergic input and promoting the conditions required for pattern recall. 
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These rate coded models do not make use of gamma or theta rhythms to 
clock storage and recall. Further, the phasing between storage and recall by reg- 
ulation of cholinergic input is relatively slow, in accord with the time constant 
of seconds for the neuronal effects of acetylcholine via muscarinic receptors [25, 
26]. However, assuming a relatively constant level of acetylcholine, rapid phasing 
between storage and recall within a theta cycle is possible via GABAs mediated 
network effects [25, 23]. In this scenario, the waxing and waning of GABAs inhi- 
bition provides a concurrent waxing and waning in the strength of layer-specific 
or recurrent excitatory connections via presynaptic inhibition. This provides con- 
ditions suitable for pattern storage or pattern recall on opposite phases of a theta 
cycle. This is explored in a spiking model of sequence memory [82], described 
below. 

4.4 A Model of Sequence Storage and Recall 

Wallenstein and Hasselmo [82] developed an associative memory model of se- 
quence storage and recall using spiking neurons. Their network contains 1000 
pyramidal cells (PCs) and 200 interneurons (INs) in a spatial grid with con- 
nectivity determined by Gaussian probabilities over distance, such that PCs 
connect with 15% of PCs and 20% of INs. INs contact 30% of PCs and 20% of 
INs. The INs are putative basket cells, and no other IN type is explicitly rep- 
resented. PCs and INs are represented by five-compartment multi-ion-clrannel 
models. PCs contain a range of ion channels (fast Na, KDR, Ca, KAHP, KCa, 
KA), whilst INs contain only fast Na and K channels. Both cell types receive 
synaptic AMPA, NMDA, GABA^ and GABA# input. PCs receive recurrent 
excitation from other PCs and slow GABAs inhibition on apical dendrites, with 
perisomatic fast GABA^ inhibition from the INs. INs receive excitation on the 
soma from PCs, with fast inhibition on soma and proximal dendrites and slow 
inhibition on distal dendrites from other INs. All synapses are assumed to have 
presynaptic GABAs receptors. During network activity, an estimate of external 
GABA concentration at each cell is made over time and used to downregulate 
all synaptic connections to implement the effect of these receptors. 

Theta oscillations are induced by rhythmic GABAergic input from the medial 
septum expressed as IPSPs on the somata and proximal dendrites of INs. Septal 
cholinergic input is modelled as a sustained steady-state excitation due to a 
partial reduction in leak current in the distal dendrites of PCs and INs. This 
level of modulation is set to give random activation of around 15% of PCs and 
INs at any one time. Within a theta cycle, INs fire bursts of APs at gamma 
frequency, whilst PCs fire one or a few spikes at a particular phase of theta. 

Whereas Menschik and Finkel [55] considered the storage and recall of dis- 
crete patterns, Wallenstein and Hasselmo [82] use their network to store and 
recall a sequence of patterns. Each pattern within a sequence to be stored is 
delivered as entorhinal afferent AMPA input to the apical dendrites of selected 
PCs for 20msecs at the start of a theta cycle, with a new pattern delivered on 
each theta cycle (every lOOmsecs). A sequence is 30 patterns long and is delivered 
five times during learning. Learning is implemented by a temporal Hebbian rule 
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applied only to NMD A conductances on the recurrent collaterals of PCs. Before 
learning, the NMDA conductances are set to random values. During learning, 
changes in conductance are proportional to the product of presynaptic activity 
(average presynaptic firing rate over the last 50ms preceding postsynaptic ac- 
tivty) and postsynaptic activity (difference of postsynaptic membrane voltage 
above a threshold value of -30mV). This gives only LTP as changes are zero if 
the postsynaptic voltage is below threshold. This rule incorporates elements of 
STDP but is different in detail from the STDP models considered above. 

At the start of each theta oscillation the PCs belonging to a particular pat- 
tern in the sequence fire APs due to the external afferent input. In addition, 15% 
of other PCs fire randomly due to background excitation (cholinergic depolariza- 
tion) at different times during the theta cycle. Due to the 50msec time window 
of the temporal Hebbian learning rule, the connections between the active PCs 
of one pattern and the active PCs of the next and subsequent patterns in a se- 
quence are not strengthened, as each pattern is separated by lOOmsecs. Instead, 
connections between pattern PCs and randomly firing PCs are strengthened. 
After a number of learning trials certain non-pattern PCs start to fire gamma 
frequency trains of APs that span in time several of the sequence patterns. These 
context sensitive cells are the glue that binds the sequence together (see Levy 
[43] for a different model of how context sensitive cells may form) . 

Presynaptic inhibition via GABAs receptors is vital to the accurate storage 
and recall of patterns in this network. During the first half of a theta cycle, 
GABAs inhibition is high. This suppresses recurrent excitation so that PC firing 
strongly reflects entorhinal afferent input which specifies the current pattern in 
the sequence. Without this reduction in recurrent excitation, activity due to 
previously stored patterns occurs early in a theta cycle and interferes with the 
formation of context sensitive cells that encode the correct ordering of patterns in 
a sequence. Later in theta, GABAs decays, and PCs start firing due to recurrent 
excitation, reflecting stored patterns. In this second half of a theta cycle, the 
network effectively shifts from storage to recall modes. Afferent input that arrives 
in the latter half of theta acts as a recall cue. Recurrent excitation results in the 
rapid (gamma frequency) replay of the subsequent section (5 or so patterns) of 
the stored sequence over the remaining part of the theta cycle. 

The context sensitive cells can be likened to the so-called place cells found 
in rat hippocampus. If a rat is moving through an environment, then PCs firing 
early in a theta cycle reflect the rat’s current location, whereas later firing PCs 
are predicting where the rat is going. As the rat moves through the place field 
of a PC, that PC’s firing occurs earlier in the theta cycle. Such phase precession 
of spike firing is a characteristic of experimentally-recorded place cells [60] . This 
is analogous to the partial sequence recall following the appropriately timed 
presentation of a cue pattern in this model [82] . 

4.5 Other Models 

A number of other modelling studies have considered the tlreta/gamma rhythm 
associative memory operation of networks of spiking neurons. Lisman and Idiart 
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[45] introduced the idea of a short-term memory buffer in which memory patterns 
are represented by the activity on each gamma cycle. During each theta cycle 
around 7 of these patterns are replayed, with repetition on subsequent theta 
cycles. Jensen, Idiart and Lisman investigated how NMDA receptor dynamics 
could account for this short-term memory buffer, and also considered how short 
and long-term memory may interact for sequence storage and recall [36, 37, 35]. 
Sommers and Wennekers [72, 73] examined how stored patterns could be recalled 
within single gamma cycles, allowing a new pattern to be recalled on each such 
cycle. These models do not explicitly include features of the interneuronal mi- 
crocircuit. Fransen and Lansner [15] considered cortical columns as the basic 
representational units in a model of neocortical associative memory, with each 
column consisting of a small network of PCs and INs. 

Other studies have explored possible mechanisms for sequence storage and 
recall. Sequence storage as continuous attractors in an autoassociative memory 
and how that might relate to hippocampal place cells has been the subject of a 
number of studies [39, 68, 81]. Melamed et al. [54] discuss the interesting problem 
of encoding behavioural sequences that occur on arbitrary timescales and may 
need to be temporally compressed to fit within the theta/gamma rhythms. 



5 The Hippocampal Microcircuit Revisited 

Neural network models of associative memory need to account for a number of 
factors. During pattern storage network activity should only include the current 
pattern to be stored, which is likely determined by external input to the network. 
Activity resulting from previously learnt patterns interferes with the storage of 
a new pattern and degrades memory performance. Thus mechanisms that limit 
such activity are necessary. Pattern recall requires that intial network activity, 
as the result of an externally applied cue that is a partial or noisy version of a 
stored pattern, evolves towards and stabilises at the appropriate stored pattern. 
The ability to achieve this is dependent upon appropriate synaptic plasticity 
(learning) during storage. Nonetheless, the activity of individual PCs must be 
tightly controlled by a thresholding mechanism for recall to work. 

The models described in the previous section provide initial explorations into 
how networks of spiking neurons may act as associative memories. Storage and 
recall dynamics are determined by synaptic and intrinsic cellular properties and 
by alteration of these properties by neuromodulation with acetylcholine. Acetyl- 
choline and GAB As-mediated inhibition may serve to set appropriate conditions 
for pattern storage by reducing synaptic transmission whilst promoting plastic- 
ity on the modifiable pathways [22,21,82]. Patterns for storage and recall cues 
are provided by afferent input from the entorhinal cortex [55,82]. Basket cells 
provide the basic gamma rhythm clock cycle that is modulated at the slower 
theta rhythm. Cueing for storage and recall proceeds at the theta rhythm. PC 
thresholding during recall is provided by basket cells [82] with a contribution 
from axo-axonic cells [55]. No other interneuronal types are explicitly included 
in these models. 
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It is possible to ascribe functionality to more of the hippocampal microcir- 
cuit. Ideas concerning interneuronal network involvement in rhythm generation 
and control of PC networks are explored in Buzsaki and Chrobak [9]. Paulsen 
and Moser [62] consider how GABAergic interneurons might provide the control 
structures necessary for phasing storage and recall in the hippocampus. Building 
on their ideas, the following observations can be made concerning the roles for 
different interneuronal types. 

Basket Cells. BCs provide inhibition to the perisomatic region (on and around 
the cell body) of PCs and are driven by feedforward pathways from entorhinal 
cortex, mossy fibers (CA3) and Schaffer collaterals (CA1), and in a feedback 
pathway by recurrent PC collaterals. They may also inhibit each other and bis- 
tratified cells. Different classes of basket cells exist that may have very different 
functions [16]. A likely role for PV-containing BCs is threshold-setting for PC 
activity during pattern recall. Many models use BCs in this way [55, 72, 73, 82]. 
Typically in these models the PCs receive an identical threshold-setting inhibi- 
tion that is a function of average PC activity over the entire network [72, 73]. In 
reality each PC will receive different inhibition depending on their particular BC 
input and the afferent drive that those BCs are receiving. That afferent drive 
will be at best an estimate of overall PC activity on the basis of the sample 
of PCs that are driving those BCs. Thus the threshold signal across the whole 
network will be rather noisy, limiting recall accuracy. How limiting this noise 
actually is has not been investigated. The models considered here [55, 82] do use 
a population of BCs, but they do not rigorously test the memory capacity of 
their networks. 

In addition, models typically use identical model cells for particular cell types. 
Recent work has demonstrated that variation in inhibitory cell properties be- 
tween cells of the same type can significantly alter the effect of population inhi- 
bition on PC activity [3, 14]. 

Bistratified Cells. BSCs provide feedforward inhibition driven by Schaffer 
collaterals to CA1 PCs and feedback inhibition driven by recurrent collaterals to 
CA3 PCs. Unlike the the strong, focal inhibition of basket cells, BSC inhibition is 
diffusely distributed to apical and basal dendrites [17]. It seems ideally suited to 
controlling synaptic plasticity during pattern recall by reducing BPAP amplitude 
and calcium entry in the dendrites. 

Axo-axonic Cells. AACs provide feedforward inhibition, driven by entorhinal 
and hippocampal PC afferents, to the axon initial segment of PCs. In constrast 
to their threshold-setting role in [55], they could be involved in shutting off PC 
output during pattern storage, when such output may not be desirable, while 
not interrupting BPAPs that are involved in synaptic plasticity. Stopping PC 
output would also stop feedback inhibition to PCs via BCs and O-LM cells, 
thus additionally aiding BPAP propagation into the dendrites and maximising 
the influence of pattern-determining input from entorhinal cortex to the distal 
dendrites of PCs. 
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However, the effect of AAC inhibition may be more selective than a simple 
block of APs emitted by a PC. A simulation study on the interaction of inhibition 
and A-type potassium channels in CA3 axons indicates that a block may be 
selective for the large diameter Schaffer collaterals, whilst allowing transmission 
to the smaller recurrent collaterals [42], 

Oriens Lacunosum-Moleculare Cells. O-LM cells send focal feedback in- 
hibition onto the distal dendrites of PCs. This provides control over pattern- 
determining input from entorhinal cortex. Such pattern-determining input is 
necessary during storage, but may not be required during pattern recall. 

Horizontal Trilaminar Cells. Some HTCs send axon collaterals outside of 
the hippocampus, particularly to the medial septum [17]. They are driven by 
PC recurrent collaterals. Feedback inhibition to the medial septum that is a 
function of local PC activity may act to modulate acetylcholinergic input to the 
hippocampus and effectively switch CA1 and CA3 circuits between storage and 
recall modes [27]. 

6 Conclusions 

Neural circuitry is extremely complex and consequently it is very difficult to re- 
verse engineer it to determine its function. Associative memory provides a useful 
computational paradigm on which to base an attempt at reverse engineering the 
microcircuit of the mammalian hippocampus. Mathematical models and com- 
puter simulations that investigate aspects of the hippocampus at the single cell 
and network levels, within the context of associative memory, have been dis- 
cussed. These models are able to assign particular functions to aspects of the 
microcircuit. However, there is still a long way to go before we fully understand 
what experimental neuroscience has already told us about the cellular properties 
and circuit architecture of the hippocampus. 



References 

1. W.C. Abraham, S.E. Mason-Parker, M.F. Bear, S. Webb, and W.P. Tate. Heterosy- 
naptic metaplasticity in the hippocampus in vivo: a BCM-like modifiable threshold 
for LTP. Proc. Nat. Acad. Sci., 98:10924-10929, 2001. 

2. B.K. Andrasfalvy and J.C. Magee. Distance-dependent increase in anipa recep- 
tor number in the dendrites of adult hippocampal CA1 pyramidal neurons. J. 
Neurosci., 21:9151-9159, 2001. 

3. I. Aradi and I. Soltesz. Modulation of network behaviour by changes in variance 
in interneuronal properties. J. Physiol . , 538:227-251, 2002. 

4. E. Barkai and M.E. Hasselmo. Modulation of the input/output function of rat 
piriform cortex pyramidal cells. J. Neurophys., 72:644-658, 1994. 

5. G-q. Bi and M-m. Poo. Synaptic modifications in cultured hippocampal neurons: 
dependence on spike timing, synaptic strength, and postsynaptic cell type. J. 
Neurosci., 18:10464-10472, 1998. 




20 



Bruce P. Graham 



6. L.J. Borg-Graham. Interpretations of data and mechanisms for hippocampal pyra- 
midal cell models. In P.S. Ulinski, E.G. Jones, and A. Peters, editors, Cerebral 
Cortex, Volume 13: Cortical Models. Plenum Press, New York, 1998. 

7. G. Buzsaki. Two-stage model of memory trace formation: a role for “noisy” brain 
states. Neuroscience , 31:551-570, 1989. 

8. G. Buzsaki. Theta oscillations in the hippocampus. Neuron, 33:325-340, 2002. 

9. G. Buzsaki and J.J. Chrobak. Temporal structure in spatially organized neuronal 
ensembles: a role for interneuronal networks. Curr. Opin. Neurobiol., 5:504-510, 
1995. 

10. G.C. Castellani, E.M. Quinlan, L.N. Cooper, and H.Z. Shouval. A biophysical 
model of bidirectional synaptic plasticity: dependence on AMPA and NMDA re- 
ceptors. Proc. Nat. Acad. Sci., 98:12772-12777, 2001. 

11. S.R. Cobb, E.H. Buhl, K. Halasy, O. Paulsen, and P. Somogyi. Synchronization of 
neuronal activity in hippocampus by individual GABAergic interneurons. Nature, 
378:75-78, 1995. 

12. J-M. Fellous and C. Linster. Computational models of neuromodulation. Neural 
Comp., 10:771-805, 1998. 

13. A. Fisahn, F.G. Pike, E.H. Buhl, and O. Paulsen. Cholinergic induction of network 
oscillations at 40Hz in the hippocampus in vitro. Nature, 394:186-189, 1998. 

14. C. Foldy, I. Aradi, A. Howard, and I. Soltesz. Diversity beyond variance: modu- 
lation of firing rates and network coherence by GABAergic subpopulations. Euro. 
J. Neurosci., 19:119-130, 2003. 

15. E. Fransen and A. Lansner. A model of cortical associative memory based on a 
horizontal network of connected columns. Network, 9:235-264, 1998. 

16. T.F. Freund. Rhythm and mood in perisomatic inhibition. TINS, 26:489-495, 
2003. 

17. T.F. Freund and G. Buzsaki. Interneurons of the hippocampus. Hippocampus, 
6:347-470, 1996. 

18. J. Goldberg, K. Holthoff, and R. Yuste. A problem with Hcbb and local spikes. 
TINS, 25:433-435, 2002. 

19. B.P. Graham. Pattern recognition in a compartmental model of a CA1 pyramidal 
neuron. Network, 12:473-492, 2001. 

20. K.D. Harris, J. Csicsvari, H. Hirase, G. Dragoi, and G. Buzsaki. Organization of 
cell assemblies in the hippocampus. Nature, 424:552-556, 2003. 

21. M.E. Hasselmo. Acetylcholine and learning in a cortical associative memory. Neural 
Comp., 5:32-44, 1993. 

22. M.E. Hasselmo, B.P. Anderson, and J.M. Bower. Cholinergic modulation of cortical 
associative memory function. J. Neurophys., 67:1230-1246, 1992. 

23. M.E. Hasselmo, C. Bodelon, and B.P. Wyble. A proposed function for hippocampal 
theta rhythm: separate phases of encoding and retrieval enhance reversal of prior 
learning. Neural Comp., 14:793-817, 2002. 

24. M.E. Hasselmo and J.M. Bower. Cholinergic suppression specific to intrinsic not 
afferent fiber synapses in rat piriform (olfactory) cortex. J. Neurophys., 67:1222- 
1229, 1992. 

25. M.E. Hasselmo and B.P. Fehlau. Differences in time course of ACli and GABA 
modulation of excitatory synaptic potentials in slices of rat hippocampus. J. Neu- 
rophys., 86:1792-1802, 2001. 

26. M.E. Hasselmo, J. Hay, M. Ilyn, and A. Gorchetchnikov. Neuromodulation, theta 
rhythm and rat spatial navigation. Neural Networks, 15:689-707, 2002. 




Dynamics of Storage in Hippocampal Associative Memory Networks 



21 



27. M.E. Hasselmo and E. Schnell. Laminar selectivity of the cholinergic suppression 
of synaptic transmission in rat hippocampal region CA1: computational modeling 
and brain slice physiology. J. Neurosci., 14:3898-3914, 1994. 

28. M.E. Hasselmo, E. Schnell, and E. Barkai. Dynamics of learning and recall at ex- 
citatory recurrent synapses and cholinergic modulation in rat hippocampal region 
CA3. J. Neurosci., 15:5249-5262, 1995. 

29. D.A. Hoffman and D. Johnston. Neuromodulation of dendritic action potentials. 
J. Neurophys., 81:408-411, 1999. 

30. D.A. Hoffman, J.C. Magee, C.M. Colbert, and D. Johnston. K + channel regulation 
of signal propagation in dendrites of hippocampal pyramidal neurons. Nature, 
387:869-875, 1997. 

31. J.J. Hopfield. Neural networks and physical systems with emergent collective com- 
putational abilities. Proc. Nat. Acad. Sci., 79:2554-2558, 1982. 

32. H. Hu, V. Vervaeke, and J.F. Storm. Two forms of electrical resonance at theta 
frequencies, generated by M-current, h-current and persistent Na+ current in rat 
hippocampal pyramidal cells. J. Physiol., 545:783-805, 2002. 

33. B. Hutcheon and Y. Yarom. Resonance, oscillation and the intrinsic frequency 
preferences of neurons. TINS, 23:216-222, 2000. 

34. N. Ishizuka, W.M. Cowan, and D.G. Amaral. A quantitative analysis of the den- 
dritic organization of pyramidal cells in the rat hippocampus. J. Comp. Neurol, 
362:17-45, 1995. 

35. O. Jensen. Information transfer between rhythmically coupled networks: reading 
the hippocampal phase code. Neural Comp., 13:2743-2761, 2001. 

36. O. Jensen, M.A.P. Idiart, and J.E. Lisman. Phyiologically realistic formation of 
autoassociative memory in networks with theta/gamma oscillations: role of fast 
NMDA channels. Learning & Memory, 3:243-256, 1996. 

37. O. Jensen and J.E. Lisman. Theta/gamma networks with slow NMDA channels 
learn sequences and encode episodic memory: role of NMDA channels in recall. 
Learning & Memory, 3:264-278, 1996. 

38. D. Johnston, J.C. Magee, C.M. Colbert, and B.R. Christie. Active properties of 
neuronal dendrites. Ann. Rev. Neurosci., 19:165-186, 1996. 

39. S. Kali and P. Dayan. The involvement of recurrent connections in area CA3 in 
establishing the properties of place fields: a model. J. Neurosci., 20:7463-7477, 
2000 . 

40. U.R. Karmarkar and D.V. Buonomano. A model of spike-timing dependent plas- 
ticity: one or two coincidence detectors? J. Neurophys., 88:507-513, 2002. 

41. T. Klausberger, P.J. Magill, L.F. Marton, J.D.B. Roberts, P.M. Cobden, 
G. Buzsaki, and P. Somogyi. Brain-state- and cell-type-specific firing of hippocam- 
pal interneurons in vivo. Nature, 421:844-848, 2003. 

42. I.L. Kopysova and D. Debanne. Critical role of axonal A-type K+ channels and 
axonal geometry in the gating of action potential propagation along CA3 pyramidal 
cell axons: a simulation study. J. Neurosci., 18:7436-7451, 1998. 

43. W.B. Levy. A sequence predicting CA3 is a flexible associator that learns and uses 
context to solve hippocampal-like tasks. Hippocampus, 6:579-590, 1996. 

44. J.E. Lisman. Relating hippocampal circuitry to function: recall of memory se- 
quences by reciprocal dentate-CA3 interactions. Neuron, 22:233-242, 1999. 

45. J.E. Lisman and M.A.P. Idiart. Storage of 7 ± 2 short-term memories in oscillatory 
subcycles. Science, 267:1512-1514, 1995. 

46. G. Maccaferri and J-C. Lacaillc. Hippocampal interneuron classifications - making 
things as simple as possible, not simpler. TINS, 26:564-571, 2003. 




22 



Bruce P. Graham 



47. J.C. Magee. Dendritic hyperpolarization-activated currents modify the integrative 
properties of hippocampal CA1 pyramidal neurons. J. Neurosci., 18:7613-7624, 
1998. 

48. J.C. Magee. Dendritic R normalizes temporal summation in hippocampal CA1 
neurons. Nat. Neurosci., 2:508-514, 1999. 

49. J.C. Magee and E.P. Cook. Somatic EPSP amplitude is independent of synapse 
location in hippocampal pyramidal neurons. Nat. Neurosci., 3:895-903, 2000. 

50. J.C. Magee, D. Hoffman, C. Colbert, and D. Johnston. Electrical and calcium 
signaling in dendrites of hippocampal pyramidal neurons. Ann. Rev. Physiol, 
60:327-346, 1998. 

51. D. Marr. Simple memory: a theory for archicortex. Phil. Trans. Roy. Soc. Lond. 
B, 262:23-81, 1971. 

52. C.J. McBain and A. Fisahn. Interneurons unbound. Nat. Rev. Neurosci., 2:11-23, 

2001 . 

53. B.L. McNaughton and R.G.M. Morris. Hippocampal synaptic enhancement and 
information storage within a distributed memory system. TINS, 10:408-415, 1987. 

54. O. Melamed, W. Gerstner, W. Maass, M. Tsodyks, and H. Markram. Coding and 
learning of behavioural sequences. TINS, 27:11 14,2004. 

55. E.D. Menschik and L.H. Finkel. Neuromodulatory control of hippocampal function: 
towards a model of Alzheimer’s disease. Artif. Intel l. Med., 13:99-121, 1998. 

56. M. Migliore. Modeling the attenuation and failure of action potentials in the 
dendrites of hippocampal neurons. Biophys. J., 71:2394-2403, 1996. 

57. M. Migliore. On the integration of subthreshold inputs from perforant path and 
Schaffer collaterals in hippocampal CA1 pyramidal neurons. J. Comput. Neurosci., 
14:185-192, 2003. 

58. M. Migliore, D.A. Hoffman, J.C. Magee, and D. Johnston. Role of an A-type 
K + conductance in the back-propagation of action potentials in the dendrites of 
hippocampal pyramidal neurons. J. Comput. Neurosci., 7:5-15, 1999. 

59. M. Migliore and G.M. Shepherd. Emerging rules for the distributions of active 
dendritic conductances. Nat. Rev. Neurosci., 3:362-370, 2002. 

60. J. O’Keefe and M.L. Recce. Phase relationship between hippocampal place units 
and the EEC theta rhythm. Hippocampus, 3:317-330, 1993. 

61. G. Orban, T. Kiss, M. Lengyel, and P. Erdi. Hippocampal rhythm generation: 
gamma-related theta- frequency resonance in CA3 interneurons. Biol. Cybern., 
84:123-132, 2001. 

62. O. Paulsen and E.I. Moser. A model of hippocampal memory encoding and re- 
trieval: GABAergic control of synaptic plasticity. TINS, 21:273-279, 1998. 

63. F.G. Pike, R.S. Goddard, J.M. Suckling, P. Canter, N. Kasthuri, and O. Paulsen. 
Distinct frequency preferences of different types of rat hippocampal neurones in 
response to oscillatory input currents. J. Physiol, 529:205-213, 2000. 

64. P. Poirazi, T. Brannon, and B.W. Mel. Arithmetic of subthreshold synaptic sum- 
mation in a model CA1 pyramidal cell. Neuron, 37:977-987, 2003. 

65. P. Poirazi, T. Brannon, and B.W. Mel. Pyramidal neuron as a two-layer neural 
network. Neuron, 37:989-999, 2003. 

66. D.D. Rasmusson. The role of acetylcholine in cortical synaptic plasticity. Behav. 
Brain Res., 115:205-218, 2000. 

67. M. Remondes and E.M. Schuman. Direct cortical input modulates plasticity and 
spiking in CA1 pyramidal neurons. Nature, 416:736-740, 2002. 

68. A. Samsonovich and B.L. McNaughton. Path integration and cognitive mapping 
in a continuous attractor neural network model. J. Neurosci., 17:5900-5920, 1997. 




Dynamics of Storage in Hippocampal Associative Memory Networks 



23 



69. B. Santoro and T.Z. Baram. The multiple personalities of h-channels. TINS, 
26:550-554, 2003. 

70. A. Saudargiene, B. Porr, and F. Worgotter. How the shape of pre- and postsynaptic 
signals can influence STDP: a biophysical model. Neural Comp., 16(3), 2003. 

71. H.Z. Shouval, M.F. Bear, and L.N. Cooper. A unified model of NMDA receptor- 
dependent bidirectional synaptic plasticity. Proc. Nat. Acad. Sci., 99:10831 -10836, 
2002 . 

72. F.T. Sommer and T. Wennekers. Modelling studies on the computational function 
of fast temporal structure in cortical circuit activity. J. Physiol. (Paris), 94:473- 
488, 2000. 

73. F.T. Sommer and T. Wennekers. Associative memory in networks of spiking neu- 
rons. Neural Networks, 14:825-834, 2001. 

74. V. Sourdet and D. Debanne. The role of dendritic filtering in associative long-term 
synaptic plasticity. Learning & Memory, 6:422-447, 1999. 

75. G.J. Stuart and B. Sakmann. Active propagation of somatic action potentials into 
neocortical pyramidal cell dendrites. Nature, 367:69-72, 1994. 

76. G.J. Stuart, N. Spruston, B. Sakmann, and M. Hausser. Action potential initiation 
and backpropagation in neurons of the mammalian CNS. TINS, 20:125-131, 1997. 

77. R.D. Traub, J.G.R. Jefferys, R. Miles, M.A. Whittington, and K. Toth. A branch- 
ing dendritic model of a rodent CA3 pyramidal neurone. J. Physiol., 481:79-95, 
1994. 

78. R.D. Traub, J.G.R. Jefferys, and M.A. Whittington. Fast oscillations in cortical 
circuits. MIT Press, Cambridge, Massachusetts, 1999. 

79. R.D. Traub and R. Miles. Pyramidal cell-to-inhibitory cell spike transduction ex- 
plicable by active dendritic conductances in inhibitory cell. J. Comput. Neurosci., 
2:291-298, 1995. 

80. A. Treves and E.T. Rolls. Computational analysis of the role of the hippocampus 
in memory. Hippocampus, 4:374-391, 1994. 

81. M.V. Tsodyks. Attractor neural network models of spatial maps in hippocampus. 
Hippocampus, 9:481-489, 1999. 

82. G.V. Wallenstein and M.E. Hasselmo. GABAergic modulation of hippocampal 
population activity: sequence learning, place field development, and the phase pre- 
cession effect. J. Neurophys., 78:393-408, 1997. 

83. X-J. Wang and G. Buzsaki. Gamma oscillation by synaptic inhibition in a hip- 
pocampal interneuronal network model. J. Neurosci., 16:6402-6413, 1996. 

84. D. Willshaw and P. Dayan. Optimal plasticity from matrix memories: what goes 
up must come down. Neural Comp., 2:85-93, 1990. 

85. R. Yuste, A. Majewska, and K. Holthoff. From form to function: calcium compart- 
mentalization in dendritic spines. Nat. Neurosci., 3:653-659, 2000. 




On the Nested Hierarchical Organization of CNS: 
Basic Characteristics 
of Neuronal Molecular Networks 



Luigi Francesco Agnati 1 ’*, Letizia Santarossa 1 , Susanna Genedani 2 , 

Enric I. Canela 3 , Giuseppina Leo 1 , Rafael Franco 3 , Amina Woods 4 , 
Carmen Lluis 3 , Sergi Ferre 4 , and Kjell Fuxe 5 

1 Department of Biomedical Sciences, Section of Physiology 

luigiagnatiOtin. it 

2 Department of Biomedical Sciences, Section of Pharmacology 
University of Modena and Reggio Emilia, via Campi 287, 41100 Modena, Italy 
3 Department of Biochemistry and Molecular Biology of the University of Barcelona, 
Marti Franques 1 Barcelona E-08028, Catalonia, Spain 
4 National Institute on Drug Abuse, Department of Health and Human Services, 
Baltimore, Intramural Research Programm Maryland 21224 
5 Department of Neurosciences, Karolinska Institutet, Stockholm, Sweden 



Abstract. It has been suggested that the Central Nervous System 
(CNS) is built as a “nested system of networks of networks” structured 
according to hierarchical principles. Thus, it is possible to recognize net- 
works at multiple scales (miniaturisation principle) moving top-down 
from neuronal to molecular networks. Such a nested organization of the 
CNS leads to the problem how circulation and integration of information 
at different levels of miniaturisation (i.e., macro-scale, meso-scale and 
micro-scale) take place. We suggest that local circuits are the highly spe- 
cialised meso-scale level capable of functionally interconnecting macro- 
with micro-scale level. Furthermore, it is suggested that the same modes 
for communication (Wiring and Volume Transmission) are in operation 
at various miniaturisation levels. This indicates a sort of “fractal organi- 
zation” of the CNS. Detailed analysis of molecular networks is discussed. 
In particular, receptor-receptor interactions are examined as an impor- 
tant input to horizontal molecular networks located at plasma membrane 
level. 



1 The Concept of “Biological Complex System” 

Complex systems are often said to consist of large numbers of simple and iden- 
tical elements that interact to produce complex behaviors. Actually, complex 
biological systems are made by a large number of diverse elements interacting 
to produce coherent and rather complex behaviors [1], Complex systems may 
be best described as networks with complex topologies [2] . The concept of “net- 
work” is used in different contexts, e.g., it is used to describe complex systems 
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such as society and Internet [3] . In the biological field the brain is considered as 
a network of neurons connected by axons and the cells themselves are considered 
as networks of molecules connected by biochemical reactions [4], [5]. 

In any network it is possible to identify vertices (or nodes) and edges or chan- 
nels (connection paths or communication channels), that allow the interactions 
among nodes [4] . The formal description of complex systems as networks of inter- 
connected elements allows the development of theoretical models and computer 
simulations. Thus, as underlined by Kitano: to understand complex biological 
systems requires the integration of experimental and computational research, in 
other words, a system biology approach [1] . However, the complexity of biological 
systems will not likely be solved only by computational efforts and the popular 
notion that “biological science will turn out into information science ” overesti- 
mates the power of the computational approach and ignores the pressing need for 
high quality data for modeling and analysis [6] . 

In fact, as stated above, complex biological systems are characterized by 
the extreme heterogeneity of the parts and, in addition, they are organized in 
intricate and highly structured networks, with hierarchies and multiple scales [7], 
[8] , [9] . Actually, many challenges of post-genomic biology are converging to the 
challenges facing engineers building complex “networks of networks” [10]. The 
individualization of hierarchies and multiple scales may lead to a reductionism 
approach namely to focus on the identification of the building elements of the 
complex system and on the different levels of the system architecture. One of the 
possible fallacies of reductionism approach in biology (i.e., the fragmentation of 
a complex biological system into parts) has been pointed out by Goethe (Faust: 
Studier zimmer) : If you want to understand and describe a living being, as a first 
step you must get rid of its animating principle; then you have in your hands 
the various parts of it. Only the life bonds will be missing; this is a pity! 

In the present paper, it is suggested that to have an understanding of the basic 
features of the operations carried out by the Central Nervous System (CNS) not 
only a hypothesis on its morphological and functional organization is needed, but 
also an insight into its logics. In other words, a hypothesis should be put forward 
on its morphological and functional organization to identify the building elements 
and the overall architecture of such a complex system. Then the “various parts 
will be in our hands” . In a subsequent step an insight should be obtained on the 
logic according to which its building elements work and how their operations are 
integrated to produce the coherent activity of the CNS. Then, a hint on “the 
life bonds” that make the system work as a whole will be obtained. The search 
in the realm of logic is, therefore, of basic importance for the understanding 
of brain function. In agreement with this view, is the Bacon’s statement: I hold 
that true logic ought to enter the several provinces of science armed with a higher 
authority than belongs to the principles of those sciences themselves, and ought 
to call those putative principles to account until they are fully established [11]. 
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2 On the Nested Organization 
of the Central Nervous System 

It has been suggested that the computational devices of the CNS are organized 
according to a “Russian doll” (“nested”) hierarchic principle [7], [8], [12]. Thus, 
it has been surmised that computation can be performed at various hierarchic 
levels in the Central Nervous System (CNS). Broadly speaking these levels can 
be listed as follows: 

— System of cellular networks, made by assemblies of cellular networks working 
as an integrated computational system (e.g., the CNS). The result of the 
integration process is a multi-facet functional response of the system, i.e., a 
so-called high level “syndromic response” of the system [8] . 

— Cellular network, made by assemblies of neurons and/or glial cells capable 
of carrying out one function (or more than one function in the case of poly- 
morphic networks) in a system of cellular networks such as the CNS [13]. It 
is suggested that a term more precise than “polymorphic networks” could 
be “polyfunctional networks” since these networks under specific pervading 
influences (e.g., a highly diffusible VT signal such as CO 2 ) [14] can change 
their functional output. 

— Local circuit, made by portions of a neuron (or neurons) that, under given 
conditions, function as an independent integrative unit [15], [16] (see also 
below). A schematic illustration of a local circuit is given in the central 
panel of Fig. 1. 

— System of molecular networks, made by the assemblies of molecular net- 
works involved in carrying out a complex cellular function and work as an 
integrated computational system. Thus, also at the cellular level there is the 
production of a “syndromic response” (i.e., a multi-facet high level cellular 
response) [17]. 

— Molecular network, made by molecules that function as a metabolic and/or 
signalling pathway in a cell. The molecular network is especially involved in 
carrying out one function but often it can also operate as a polyfunctional 
network, since also these networks (as the cellular networks, but at a different 
level of miniaturization) under specific pervading influences can change their 
functional output (Fig. 1). 

— Macro-molecular complex, made by functional groups in a large molecule 
(or, e.g., protomers in a multimeric protein) that allow the macromolecular 
complex to carry out its function. 

The first two levels can be considered as “macro-scale levels”, the third is 
the “meso-scale level”, the last three ones as “micro-scale levels” (see Fig. 1). 
The impact of the elaborations at any of these levels on the overall brain func- 
tion will be dependent on the hierarchic level at which they have been carried 
out, the location of this computing device in the mass of the CNS and on the 
temporal aspects that characterize these elaborations. Thus, the understanding 
of the integrative capabilities of the CNS is linked to the evaluation of both the 
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spatial-temporal relationships among informational elements within a certain 
computational level and the spatial-temporal relationships among computations 
carried out at levels of different miniaturization. 

The suggested nested organization of the computational devices in the CNS 
leads to the problem namely how the circulation of the information among the 
different levels of miniaturization, i.e. , among the micro-scale, the meso-scale 
and the macro-scale levels takes place. In the past, we have suggested that the 
local circuits could be the highly specialized meso-scale level capable of intercon- 
necting the macro- with the micro-scale levels (see Fig. 1) and hence allowing 
the continuous and bi-directional transfer of information between the molecular 
level and the neuronal network level [16]. 

Local circuits can be defined as any portion of the neuron (or neurons, 
especially short-axon neurons (i.e., Golgi II. type neurons) that, under given 
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Fig. 1 . Schematic representation of three networks of different hierarchic levels that 
can be recognized in the morphological and functional organization of the CNS. It can 
be observed that local circuits belong to a meso-scale hierarchic level and can therefore 
operate as a “bridge” between macro-scale and micro-scale hierarchic levels. For further 
details, see text. 
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conditions, functions as an independent integrative unit [15]. This phenomenon 
depends on the fact that in local circuits membrane domains belonging to the 
same and/or different cells are so close to each other to make possible prompt and 
highly effective chemical and electrotonic interactions between molecular circuits 
present in the plasma membranes facing each other. These computational capa- 
bilities are very likely of the highest importance since local circuits may indeed 
prove to be the neural substrate of higher brain functions [18]. In agreement with 
this view Cajal reported that the functional superiority of the human brain is 
intimately linked to the prodigious abundance and unaccustomed wealth of forms 
of the so-called “neurons with short, axons'’ [15]. Local circuits are characterized 
by a high density of computational structures, the potential for complex interac- 
tions, and the rapid processing of information, owing to the tight interweaving 
of fine neuronal processes and the short intra- and inter-neural distances over 
which they operate. In addition, their use of low-level graded potentials requires 
high sensitivity, permitting discrimination that cannot be equalled by an all-or- 
none process and endows them with the capability of explaining the “analog” 
nature of mental transformations [19]. 

Thus, local circuits (see Fig. 1) could represent a privileged site where elec- 
trotonic signals could affect the molecular networks located at cell plasma mem- 
brane level [20] . Molecular networks are made by molecules (especially proteins) 
that are interconnected to form circuits capable not only of transmitting infor- 
mation but also of processing the information as it is being transmitted [21]. 
It may be of importance to distinguish metabolic networks from signal (regula- 
tory) networks, since metabolic networks are devoted to “core functions” such 
as metabolism and energy generation, while signal networks are devoted to reg- 
ulate one of metabolic networks. It should be mentioned that this subdivision 
is linked to the complexity of the biological systems, since information from 
genomic studies concerning the molecular makeup of cells, indicates that the 
proportion of components devoted to signal (regulatory) networks as opposed to 
those devoted to core functions is dominant [22] . We will focus our attention on 
signal (regulatory) molecular networks. For this type of networks it is possible 
to further distinguish (see Fig. 2): 

— Horizontal Molecular Networks (HMNs): these are molecular circuits made 
by molecules embedded and/or associated with the plasma membrane that 
carry out complex functions of basic importance for informational exchange 
between the cell and the extracellular environment (Lo level). Hence, HMNs 
represent the main target of electrotonic and chemical signals present at local 
circuit level. It should be mentioned that extra cellular chemical signals are 
usually recognized and decoded by receptors. 

— Vertical Molecular Networks (VMNs): these are molecular circuits that ex- 
tend into the cytoplasm and often reach the nucleus. They are part of the 
informational channels to the genome (Li level). 

— Kernel Molecular Networks (KMNs): these are molecular circuits localized 
inside the nucleus; thus they are devoted to the regulation of gene expression 
(L 2 level). 




On the Nested Hierarchical Organization of CNS 



29 



It is important to distinguish the molecules of the HMNs that are integral 
proteins of the plasma membrane (for example the ion channel receptors and G 
protein-coupled receptors (GPCRs)) from the molecules of the HMNs that are 
transiently associated with the plasma membrane (for example various types of 
receptors interacting proteins [23]). These could be defined as “shuttle Lo - Li 
molecules” moving back and forth between the membrane and the cytoplasm. 
There are also “shuttle C - L 2 molecules” moving back and forth between the 
cytoplasm and the nucleus. Of particular importance are two classes of proteins 
the “importins” and the “exportins” . As a matter of fact, “cargo molecules” are 
transported (thanks to the “importins” ) from the cytoplasm to the nucleus where 
they carry out their function and vice versa “cargo molecules” are transported 
(thanks to the “exportins” ) from the nucleus to the cytoplasm where they fulfill 
their function [24] . Similarly to the HMNs, which work as “intelligent” interfaces 
between extracellular and intracellular networks [17], at the nuclear membrane 
level macro- molecular complexes (nuclear pores) control the cytoplasm/nuclear 
traffic, e.g., allowing the importin-cargo protein complex to enter into the nu- 
cleus and the exportin-cargo protein complex to move from the nucleus to the 
cytoplasm. Furthermore, it should be noticed that the HMNs are “two-sided 
input/output networks”, i.e., can receive signals from the extracellular as well 
from the intracellular environment of the cell and can give an output signal to 
both the extra cellular and the intracellular environment of the cell. Similarly, 
the macro-molecular complexes at nuclear membrane level can control the sig- 
nals to be imported into the nucleus to regulate gene expression and signals to 
be exported to participate in the cytoplasmatic biochemical machinery. 

A general scheme for the environment /cell cross-talk is as follows: extracellu- 
lar signals activate specific HMNs. One of the main effects of this activation is to 
cause proteins involved in gene control, resident in the cytoplasm, to reach upon 
phosphorylation the nuclear membrane macro-molecular complexes regulating 
the cytoplasm/nuclear traffic and to migrate into the nucleus where they as tran- 
scription factors, cause changes in the activity of genes. The cellular response 
can be confined within the cell or it can reach the extracellular environment, as 
is the case upon release of a chemical signal. Thus, in the cell signalling pro- 
cess at least three major compartments should be considered: the extracellular 
environment, the cytoplasm, the nucleus and two “intelligent interfaces” at the 
boundaries of the compartments, namely the HMNs at plasma membrane level 
and the macro-molecular transport complexes at nuclear membrane level (see 
Fig. 2). 



3 The Concept of “Fractal Logic” of the Nested 
Organization of the Central Nervous System 

If we accept the view of the CNS as a nested hierarchical complex system, it is 
possible to search for schemes of functional organization at the various miniatur- 
ization levels. It is suggested that basically the same schemes for communication 
and elaboration of the information are in operation at the various miniaturiza- 
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SCHEMATIC REPRESENTATION OF A MOLECULAR SIGNAL NETWORK 



INPUTS 

IT 




Fig. 2. Schematic representation of different types of molecular networks that can be 
recognized in the morphological and functional organization of the cell. For further 
details, see text. 



tion levels [12], [17], [21]. This functional organization suggests a sort of “fractal 
structure” of the CNS. As matter of fact, according to fractal geometry, fractal 
objects have the property that as we magnify them, more details appear but 
the shape of any magnified detail is basically the same as the shape of the orig- 
inal object. It is, therefore, suggested to introduce the term “fractal logic” to 
describe networks of networks where at the various levels of nested organization 
the same principles (logic) to perform operations are used. Thus, if this is true 
not only the discovery of the operation principles at any nested level sheds light 
on the way of operation of the other levels, but also it is possible to have some 
understanding of the operation of the entire system from the knowledge of the 
operation principles carried out at any nested level. We will use this conceptual 
frame starting from the best-studied nested level: the neuronal networks, where 
already Sherrington’s studies allowed the assessment of some basic principles 
of operation. In particular, the concept of “fringe” should be mentioned (see 
Fig. 3). 
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Fig. 3. Schematic representation of different types of the morphological and functional 
organization of the neural networks according to the classical work of Sherrington. In 
particular the concept of “fringe” is illustrated. For further details, see text. 



A fringe is an area of overlap between two networks and it leads to facilitation 
or occlusion. In Fig. 3 if B has a low discharge rate Y does not fire, but if A or C 
are active Y excitability has been increased and it is easier for activity of neuron 
B to fire Y. Y is said to be in the subliminal facilitation fringe of A and C. If A 
fires at high rate X and Y will be activated; if C fires at high rate Y and Z will 
be activated. If A and C fired simultaneously at high rate, three neurons (X, Y, 
and Z) will discharge. Thus, the simultaneous stimulation of A and C will give 
a number of activated neurons (3) smaller than the number (2) of the activated 
neurons due to stimulation of A and the number (2) of the activated neurons 
due to stimulation of C. 

The concept of fringe has not been yet introduced in the study molecular 
networks, while other classical operation principles such as feed-back inhibition 
or feed-forward inhibition, spatial or temporal summation have been already 
largely used in molecular networks and will be considered in the present paper 
only if needed. The concept of fringe is of particular importance since it can 
be a theoretical tool to characterize the cross-talk between molecular networks 
and to understand the production of complex cellular responses (“syndromic 
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responses” at cellular level of miniaturization). Furthermore, it is sufficient to 
have control signals impinging on the subliminal fringe to regulate the cross-talk 
process and hence to modulate the syndromic response [8]. Finally, it should be 
noted that the fringe could also cover the plasma membrane in the case that two 
signal pathways have some proteins in common at the plasma membrane level, 
or directly interact like in receptor-receptor interactions forming heterodimers 

[17]- 

As mentioned above, integration of elaborations carried out at different hi- 
erarchic levels as well as within one and the same hierarchic level is achieved 
by transmission of electrical and chemical signals. On the basis of the fractal 
organization of the CNS, it is possible to suggest that also the communication 
modes have similarities. At the neural network level it has been suggested that 
signals can be transmitted according to two modalities of communication: 

1. Volume Transmission (VT): is “one to many” communication. The signal re- 
leased from the source into a medium (extracellular fluid) reaches its targets 
either along preferential pathways or in a three-dimensional fashion. 

2. Wiring Transmission (WT): is a “point to point” communication. The signal 
released from the source reaches its target following “wired pathways” (e.g., 
axons and synaptic contacts) [13], [25], [26]. 

The same concept has been applied to the molecular networks, where it has been 
suggested that [17]: 

1. VT is the diffusion of signals, especially (but not only) small molecules and 
ions (e.g., Ca 2+ ), within the intracellular environment. It should be noticed 
that molecular diffusion in the cytoplasm of small solutes such as metabo- 
lites, second messengers and nucleotides is rather high, namely only about 
four times slower than in water [27]. 

2. WT is the communication process within the intracellular environment usu- 
ally occurring along protein assemblies (see below) or thanks to microtubule- 
associated motor proteins such as kinesins and dyneins. These proteins, by 
using the energy of ATP hydrolysis, “walk” along microtubules moving vesi- 
cles and organelles from the centre of the cell towards the periphery. 

In order to speak of molecular signal networks it is necessary as a first step to 
describe their structural aspects and then define what is meant with the terms: 
“node” and “channel”. 

4 Focus on Molecular Networks 

4.1 Structural Aspects 

Many proteins involved in molecular networks are constructed from a number 
of different small domains with distinct binding specificities. The molecule that 
is bound (ligand) can be as small as the oxygen molecule, or as large as another 
protein or as large as a DNA sequence (e.g., the TATA box). Thus, specific 
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recognition of other molecules is central to protein function and each protein 
can recognize (bind) more than one ligand [28] . This makes possible the assem- 
bly/disassembly of the proteins into larger or smaller aggregates with different 
functional “emergent” properties. A class of proteins is of a great importance 
for the process of assembling/disassembling of macro-molecular complexes, the 
so-called “scaffold proteins” . Scaffold proteins bind other proteins keeping them 
in place or they can simply tether pathway components of a signalling cascade 
increasing their likelihood of acting on one another. In some other instances, 
scaffold proteins can play a catalytic role orienting and/or allosterically regu- 
lating pathway kinases. Thus, they operate as important allosteric regulators 
and act as switchers or amplifiers in a molecular network in this way control- 
ling the flow of information [29], [30]. It is, therefore, important to distinguish 
“diverter scaffolds” i.e. , organizing centres that control the flow of information 
in signalling pathways. These are “integrative nodes” participating in multiple 
pathways and therefore may have a role in fringes [17]. 

Thus, scaffold proteins enlighten one of the most peculiar properties of pro- 
teins, namely the possibility of interacting with each other to form macro- 
molecular complexes. As a matter of fact, different classes of molecules (proteins, 
lipids and glycosides) can be functionally interconnected at the membrane level 
and they can form aggregates that may work as a molecular network [31], [32]. 
However, especially proteins form such complexes since they are endowed with 
the so called “Lego property” i.e. the capability of a steric fitting with other 
proteins to form high molecular weight complexes that are capable of emergent 
functions [33] . 

The extraordinary potentiality of proteins for such interactions should be 
noticed since even if it is true that proteins usually have only two to three 
interacting domains, some human proteins contain up to 130 domains [2]. Hence, 
proteins can very effectively work as “modules” capable of building up supra- 
molecular complexes [17], [31], [34], [35], [36]. This can be used for a variety 
of molecular events, for instance, to “hide” functional groups of a protein or 
to have emergent complex functions performed by protein aggregates. On the 
other hand, protein-protein association is, inter alia , strongly regulated by a very 
common class of intracellular chemical reactions: protein phosphorylation [37], 
[38] and, thanks to this chemical reaction, alterations in the protein conformation 
occur and hence in the protein function. In particular, alterations in the protein 
conformation can favour the formation of different supra-molecular complexes 
from the same building blocks, i.e., from the same proteins. 

As shown in figure Fig. 4 we have differentiated the “Lego property” from the 
“Allosteric control” of protein functional characteristics. The allosteric control 
refers to the existence on a protein (classically an enzyme or, more generally, a 
“protein operator”) of one or more secondary binding sites that can bind regu- 
lators. The binding of a regulator to the protein can enhance (positive allosteric 
regulator) or reduce (negative allosteric regulator) the affinity of the primary 
binding of the “protein operator” site for its ligand. However, it should be no- 
ticed that the allosteric modulation of proteins can be of paramount importance 
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Fig. 4. Schematic representation of the “Allosteric property” of proteins (e.g., enzymes) 
that allows the switching on/off of its catalytic action. In the lower panel the Lego 
property of proteins (e.g., scaffold proteins) is illustrated. The “Lego property” allows 
the building of macro-molecular complexes endowed with emergent properties. It should 
be noticed that it is possible that a GPCR upon the binding with its own ligand can 
undergo the conformational change necessary for the Lego property to develop and 
thus to trigger out the assembly of the macro-molecular complex. 

to allow the fitting process between two protein modules. Thus, allosteric modu- 
lation can represent a preliminary necessary step for the appearance of the Lego 
property and of protein complexes endowed with emergent characteristics. In this 
context it is possible to mention the fact that some protein-protein interactions 
are formed only among receptors that have undergone a conformational modi- 
fication, e.g., after the phosphorylation-dephosphorylation process. The binding 
of a ligand to its membrane receptor can also trigger out the scaffolding process 
(Fig. 4). 
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The Lego property has a great importance for GPCRs, since these receptors 
can take part in protein-protein interactions in the plane of the membrane that 
allow the construction of supra-molecular complexes with distinct functional 
properties. In particular, interactions among GPCRs have been named “receptor- 
receptor interactions” (RRI) [17], [39], [40]. It has been suggested that RRI can 
lead to the formation of “receptor mosaics”, i.e. , of clusters of receptors working 
as functional units and, at least for while, as a stabile macromolecular complex 
that can underlay learning and memory [33], [41]. Receptor mosaics represent 
not simple inputs to the VMNs but rather functionally relevant parts of HMNs 
capable of integrating different signals already at the plasma membrane level 
(Fig. 5). It is possible to summarize some features of a receptor mosaic as follows: 

— A receptor cluster works as a receptor mosaic if and only if the cluster of R1 
- R2. . .Rn is such that any receptor modulates the biochemical/functional 
features of at least another receptor of the cluster, i.e. if and only if receptor- 
receptor interactions operate within the cluster of molecules. 

— The fluctuation of each receptor (of the receptor mosaic) among its possible 
conformational states is conditioned by the conformations of the other re- 




UUOd> MOLECULES OF THE HORIZONTAL MOLECULAR NETWORK 
( (, ( ) RECEPTOR MOSAIC 

4 4 INPUTS AND OUTPUTS OF THE MOLECULAR HORIZONTAL NETWORK 
-«-► PROTEIN-PROTEIN DIRECT INTERACTIONS ( "LEGO PROPERTY" ) 



Fig. 5. Schematic representation of the Horizontal Molecular Network (HMN). It 
should be noticed that clusters of receptors, via receptor-receptor interactions, can 
be part of HMNs. For further details, see text. 
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ceptors in the mosaic (receptor-receptor interactions). Hence, each receptor 
will respond to its ligand in a way that depends on its conformation and thus 
on its interactions with the other receptors of the macromolecular complex. 

— Long-range interactions mediated by other membrane components may take 
place [42], [43], 

RRI can take place via oligomerization. This term is used in molecular bi- 
ology to indicate any multimeric protein that contains a finite, relatively small, 
number of identical subunits (i.e., protomers) that are not in a covalent linkage 
but in a state of reversible association with each other [44]. The formation of 
oligomers can take place thanks to hydrophobic interactions, hydrogen bonds, 
and electrostatic interactions. Oligomerization can occur via interactions be- 
tween extra cellular and/or intramembrane and/or intracellular domains of the 
two GPCRs [14]. Oligomerization is only one mechanism for receptor-receptor 
interaction since GPCR can also directly interact with ion channel receptors 
(GABAA receptor, NMDA receptors, [45]). 

We suggest to distinguish at the membrane level (thus when the GPCRs are 
already inserted in the plasma membrane) three types of RRIs (see Fig. 6): 

1. Oligomerization of GPCRs occurs via a direct interaction. Thus, the oligo- 
merization process has taken place without the intervention of other proteins. 
From a functional point of view this type of RRI is a non-mediated inter- 
action and the macro- molecular complex is formed only by GPCRs (Type 1 
RRI). 

2. Oligomerization process of GPCRs has been made possible by adapter (or 
scaffold) proteins that bind the GPCRs and keep them in close proximity 
to each other. From a functional point of view this type of RRI is a non- 
mediated interaction. However, other molecules besides GPCRs form the 
macro- molecular complex (Type 2 RRI). 

3. The adapter-linkage of GPCRs, where adapter (or scaffold) proteins mediate 
the interaction between GPCRs. Thus, the receptors work as a functional 
unit thanks to the linkage provided by the adapter (scaffold) proteins that 
via allosteric interactions transmit the activity state of one GPCR to the 
other one. From a functional point of view this RRI is a mediated interaction 
and other molecules besides the GPCRs form the macro-molecular complex 
(Type 3 RRI). 

Since receptor mosaics are connected with effector proteins (e.g. enzymes 
and ion channels) and work as an integrative unit in HMNs (see Fig. 5) it has 
been proposed that receptor mosaics at synaptic level can affect the efficacy of 
the synaptic transmission (“synaptic weight”) and therefore may have a role in 
learning and memory processes [21], [33], [41]. 

GPCRs are organized into functional units within the HMNs. More than 50 
GPCR.-associated proteins are involved in building these units. These protein 
networks participate in the trafficking, targeting, signalling, fine-tuning, and 
allosteric regulation of GPCRs. Proteins interacting with the C-terminus of the 
GPCRs are the most numerous [46]. 
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Fig. 6. Schematic representation of the oligomerization process among proteins at 
plasma membrane level. According to the proteins involved in this process different 
types of receptor-receptor interactions can be distinguished. For further details, see 
text. 

In this context one should mention the receptor activity modifying proteins 
(RAMPs). It has been shown that the interaction of the same GPCR with differ- 
ent members of RAMPs determines its recognition and decoding properties [47]. 
Different classes of RAMPs could exist with high affinity for distinct receptor 
subtypes. 

While the extracellular domain has a predominant role of RAMPs in defining 
the properties of the majority of Class II peptide GPCRs the transmembrane 
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domain has a role in the RAMP-receptor interaction [48]. Thus, it may be sur- 
mised that RAMPs may, in some instances, work as “bridges” between two Class 
II peptide GPCRs. 

The dopamine-receptor-interacting proteins (DRIPs) could represent a spe- 
cial class of RAMPs. As a matter of fact, it has been demonstrated that a cohort 
of cytoskeletal, adapter and signalling proteins (the DRIPs) substantially reg- 
ulate dopaminergic transmission. In particular, the discovery that two DRIPs, 
calcyon and neuronal Car + sensor 1 (NCS-1), are upregulated in schizophrenia 
highlights the possibility that altered protein interactions and defects in Ca 2+ 
homeostasis might contribute to abnormalities in dopamine neurotransmission 
and hence may have a role in neuropsychiatric disorders [23]. Dopamine D1 
receptors via Gs stimulate accumulation of cAMP. However, D1 receptor stim- 
ulation can also increase the intracellular Ca 2+ level in the presence of calcyon 
(a 24kDa single TM protein that interacts with a 16-amino-acid sequence of the 
C-terminal of D1 receptor), which increases the affinity of the D1 receptor for 
Gq/11. 

The mechanism of action of NCS-1 can be described in the frame of a molec- 
ular network around the D2 receptor. The main functional role of this molecular 
network is of allowing the NCS-1 inhibition of the GRK2-dependent desensitisa- 
tion of the D2 receptor in a Ca 2+ -dependent manner. It should be noticed that 
GRK-mediated receptor phosphorylation promotes the binding of beta-arrestins, 
which not only uncouple D2 receptor (and other GPCRs from G-proteins: de- 
sensitization process) but also target the receptor for internalization in clathrin- 
coated vesicles [49]. The network involved is schematically represented in Fig. 7. 

It may be speculated that some of the so called “orphan G-protein coupled 
receptors” (oGPCRs) [50] are “true orphan receptors”, i.e., they do not have 
an extra cellular ligand binding pocket but they may function as modulators of 
other GPCRs and/or can be activated by GPCRs with which they interact in 
the HMN. Hence, some oGPCRs can even have the functional role to activate a 
cytoplasmatic cascade when they interact with other GPCRs in the HMN (see 
Fig. 8). Thus, this GPCR-oGPCR putative receptor-receptor interactions could 
be an example of RAMP-like protein-receptor interactions. 



4.2 Basic Definitions 

Nodes and channels can be identified in a molecular network and in general 
terms nodes can be thought as structures capable of chemical operations, while 
channels are molecules and/or processes that connect nodes. 

Let us give some tentative definitions: 

— Nodes (N) are proteins that by interacting with other molecules undergo a 
conformational change and/or induce a change in the interacting molecule. 
These changes have a functional (informational) meaning. It may be sug- 
gested that receptors, protein kinases, scaffold proteins (when they act as 
allosteric modulators, see above), cytoskeleton proteins (when they act as al- 
losteric modulators, see below), RAMPs and transcription factors are nodes 
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Fig. 7. Schematic representation of the mechanism of action of the neuronal Ca 2+ 
sensor 1 (NCS-1). This mechanism can be described in the frame of a the creation of a 
molecular network around the D2 receptor. The main functional role of this molecular 
network is that of allowing the NCS-1 inhibition of the GRK2-dependent desensitisation 
of D2 in a Ca 2+ dependent manner. 

of molecular networks. A few Ns are highly connected nodes (i.e., they can 
interact with a multiplicity of signals). Thus, they function as “hubs” [1] or 
as “crucial Ns” [33], 

— Channels (C) or edges are transmission lines among Ns. They can be active 
or passive (i.e., the migration of the signal along the channel can be due to 
an active transport or it can occur thanks to pre-existing energy gradients). 
Furthermore, it is suggested to distinguish between “Turing controlled chan- 
nels” and “non-Turing controlled channels” . In the first case the channel is 
controlled by a “modifier”, while in the second case it is not [51], [13]. 

— Anchoring and Tethering elements (ATE) are proteins such as scaffold and 
cytoskeleton proteins that keep in the proper place Ns (therefore it is possible 
to distinguish mobile from sessile nodes). 

— Modules (M) are assemblies of Ns especially devoted to carry out a specific 
function. 

— Molecular Networks (MN) are assemblies of Ms, single Ns and Cs. 

— A signal is a coded information transmitted along a C. It is possible to 
distinguish: 
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THE oGPCR is INVOLVED IN 
REC-REC INTERACTIONS WITH 
OTHER GPCRs, THE oGPCR IS 
NOT CAPABLE OF ACTIVATING 
G -PROTEINS, HENCE THE 
oGPCR OPERATES ONLY AS AN 
ALLOSTERIC MODULATOR OF 
THE GPCR WITH WHICH IT 
INTERACTS. 



THE oGPCR IS INVOLVED IN 
REC-REC INTERACTIONS WITH 
OTHER GPCRs, THE oGPCR IS 
ALSO CAPABLE OF ACTIVATING 
G-PROTEINS, HENCE THE 
oGPCR OPERATES BOTH AS AN 
ALLOSTERIC MODULATOR OF 
THE GPCR WITH WHICH IT 
INTERACTS AS WELL AS AN 
ACTIVATOR OF G-PROTEINS. 



POSSIBLE FUNCTIONAL MEANING OF “REAL” ORPHAN G -PROTEIN COUPLED RECEPTORS (oGPCRs) 



TRANSMITTER 
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Fig. 8. Schematic representation of one possible functional meaning of the so-called 
orphan G-Protein Coupled Receptors (oGPCRs), i.e., of GPCRs devoid of their own 
ligand that can interact with GPCR with or without the potential to bind and activate 
G proteins themselves. For further details, see text. 



1. VT-signals. These are diffusible signals (usually small molecules) that 
move along Cs. 

2. WT-signals. These are signals moving along a “wire”, i.e., a set of Ns 
assembled into a M. These signals move from one protein to the inter- 
acting ones and they appear as a transient change in the conformation 
of the down stream protein. 

Summing up, the modular nature of the signalling pathways and the process 
of protein-protein interactions [35], [40], [52], [53], [54] lead to the formation of 
molecular circuits that can be traced from the plasma membrane compartment, 
through the cytoplasm, towards the nuclear compartment (see Fig. 2 and Fig. 5). 
The Lego property of proteins is of paramount importance for most signalling 
pathways. For example, the Ras/Raf/MEK/MAPK pathway has only proteins 
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as components with no small molecular weight second messengers. Hence, the 
assembly of proteins makes the backbone of this “wired signalling pathway” and 
the message transmission consists of subsequent conformational changes of these 
proteins. 



4.3 Communication Aspects 

As stated above, WT-communications occur via the channeling of the informa- 
tion along some sort of physical delimited pathway (the information channel). In 
the case of molecular networks WT occurs along a set of proteins interconnected 
with each other (WT1, see Fig. 9) or the chemical signal jumps along active 
sites present in proteins held in situ by a scaffold protein (WT2, see Fig. 9; 
[29]). It can be surmised that while VT is very likely important both for com- 
munication between different compartments and within compartments, WT is 
mainly used for communication inside a compartment. However, WT can also 
mediate transmission across the interface between two compartments as is the 
case of GPCRs that transmit the signal caused by the binding of the ligand to 
the G-protein subunits. It should also be noticed that to get WT, besides e.g. 
molecular scaffolds, the cell could use other specific non-leaking routes such as 
the directional transport of signalling proteins via filament-bound motor pro- 
teins or via vesicular transport (see also above: Section 3). It is not always easy 
to distinguish between WT and VT in the molecular circuits. For example, it 
has been mentioned that WT depends on a process of “assembly /disassembly” 
of the “molecular wire” and the intracellular diffusion (and/or transport) of 
proteins is at the bottom of this process. Thus, both VT signalling and WT as- 
sembly depends on diffusion of molecules probably along preferential pathways. 
We suggest distinguishing two types of intracellular VT: 

1. VT-signalling: diffusion of chemical signals (usually low molecular weight 
molecules) that are recognized and decoded by a macromolecule. 

2. VT-dependent wire assembly: diffusion of usually high molecular weight 
molecules that interact with other large molecules to produce a macro- 
molecular complex, i.e., a “molecular wire” or a “node”. This aspect can 
be of importance when studying the functional role of the “shuttle Lo - Li 
molecules” and of the “shuttle Li - L 2 molecules” since these molecules can 
be VT signals (when they are small molecules that are decoded by macro- 
molecules, see above) or be element used to build up a wire for WT. 

For sake of clarity, a crude classification of some of these aspects is presented 
in Fig. 9 where the distinction is also made between the “VT-dependent wire- 
assembly” and the “tether-dependent wire-assembly”. 



4.4 Compartmentalization of the Signalling Pathways 

A basic principle of the organization of the CNS is the compartmentalization of 
the signalling pathways that can be recognized at any hierarchic level and hence 
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FUNCTIONAL MEANINGS OF MOLECULE DIFFUSION IN THE CYTOPLASM IN 
RELATION TO VOLUME AND VMRING TRANSMISSION 




VT SIGNALING: 

DIFFUSION OF ACHBdlCAL S IGNAL 
POSSIBLY ALONG PREFERENTIAL 
PATHWAYS TO ACTIVATE OR INHIBIT A 
NODE. 



SLOWWIRE- ASSEMBLING: 

0 IF FU S 10 N 0 F H IG H M 0 LEC ULAR WEIG HT MO L ECU LES P 0 S S IB L Y ALO NG P REF ERENTI AL PATHWAYS . 




FAST WIRE- ASSEMBLING: 

THE SCAFFOLD PROTEIN TETHERS OTHER PROTEINS KEEPING THBul CLOSE TO EACH OTHER. THIS 
TETHER ACTION INCREASES THEIR LIKELIHOOD TO INTERACT TO ASSBrtBLE AWIRE. 




Fig. 9. Schematic representation of the Volume Transmission (VT) within the cell 
(upper panel) and of the processes that allow the assembly of “wires” for the Wiring 
Transmission (WT). It should be noticed that it may be relevant to distinguish two 
types of WT, namely WT1 and WT2 according to the characteristics of the transmis- 
sion of the signal along the “wire”. For further details, see text. 

also at molecular level. The compartmentalization allows restricted environments 
particularly suited for the spatial separation of signal pathways. Hence, it allows 
the employment of the same signal for different aims. In other words it makes it 
possible to use “open code” signals [55] for “private” communications. It allows 
also the occurrence of a chemical reaction in a suitable and highly controlled 
chemical environment where often the molecules that have to interact are in the 
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optimal conformation, vicinity and concentration. An important case of compart- 
mentalization is the presence of a specialized location for signal inputs to the 
cell. Besides the well known case of synaptic input localization (e.g., the molec- 
ular organization of the post-synaptic density), it should also be mentioned that 
very often other inputs are localized in specialized micro-domains of the plasma 
membrane such as the lipid rafts and/or the caveolae [8], [56]. Another impor- 
tant case is the confinement of a part of a complex molecular network within 
a cell organelle, i.e. , in isolation with respect to the cytoplasmic environment. 
Two important examples of compartmentalization will be considered: 

1. The first one is the compartmentalization of two VT signals produced by the 
same node (in the present case an enzyme) at the membrane input level. 

2. The second one is the demonstration that the activation of the same node 
at membrane level (a receptor) produces different physiological and patho- 
logical effects according to the membrane domain where it is located. 

In order to illustrate the first case the transduction mechanism of the GPCR 
working via the plrosphatidylinositol cascade will be mentioned. As a matter 
of fact, the binding of a ligand (e.g., AClr) to its membrane receptors (in the 
present case, muscarinic receptors) leads to activation of a specific G protein 
called Gq that activates phospholipase C, thereby generating both inositol- 
(l,4,5)triplrosphate (InsP3) and cliacylglycerol (DAG). The InsP3 is water-solu- 
ble and quickly diffuses through the cytosol binding to a calcium ion channel in 
the ER releasing Ca 2+ ions into the cytosol. The DAG generated by phospholi- 
pase C is hydrophobic and thus remains within the membrane, where it activates 
the protein kinase C. This enzyme can plrosplrorylate specific serine and threo- 
nine groups on target proteins. It should be mentioned that DAG moves rapidly 
in the membrane since lipids can move laterally within the membranes several 
micrometers per second [57]. Thus, the generation of two VT-signals is achieved, 
one diffusing in a roughly isotropic three-dimensional fashion in the cytosol, the 
other one diffusing in a roughly two-dimensional fashion in the plane of the 
membrane. Hence, even if the two VT-signals are produced by the same decod- 
ing mechanism, they diffuse in different compartments. This datum can also be 
viewed as an example of the more general phenomenon mentioned above, namely 
that the lipid environment of the membrane introduces a two-dimensional reac- 
tion environment, with enhancement of effective concentrations, and component 
orientation relative to the membrane [29] . 

The second case will be illustrated by analysing the different effects on cell 
survival of the activation of synaptic NMDA receptors versus the activation of 
extra-synaptic NMDA receptors [58]. The NMDA receptors are permeable to 
both sodium ions (phenomenon that contributes to the post-synaptic depolar- 
ization) and calcium ions (phenomenon that generates the intracellular calcium 
ion transients, and hence activates intracellular molecular networks). NMDA 
receptors have both synaptic and extra-synaptic localizations. Extra-synaptic 
NMDA receptors have an ill-defined physiological role, while the basic role of 
synaptic NMDA receptors for many of the activity-dependent changes in synap- 
tic weight and, hence, for learning and memory processes is well established. 
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More controversial is the role of NMDA receptors on neuronal survival, since on 
one hand there is evidence that physiological levels of NMDA activation promote 
neuron survival also in challenging conditions (e.g., after mechanical trauma), 
while on the other hand a large body of evidence demonstrates that abnormally 
high levels of glutamate and therefore of NMDA activation (e.g., after stroke) 
cause cell death. It has been suggested that not only is there an inverted 11- 
shaped dose response curve for the glutamate effects on cell survival (too low or 
too high levels are responsible for the pro-death effects of glutamate) but also 
that a different signalling process occurs when the synaptic NMDA receptors 
versus the extra-synaptic NMDA receptors are activated [58] . 

Thus, it has been shown that while synaptic NMDA activation induces the 
Ca 2+ activation of CREB (cAMP-response-element-binding-protein) that has 
an important role in synaptic plasticity and learning and memory processes [59] , 
the Ca 2+ entry via extra-synaptic NMDA receptors is coupled to a dominant 
CREB “shut-off” pathway that causes CREB deplrosplrorylation of its activa- 
tor site (Ser 133). Since CREB controls several pro-survival genes, this shut-off 
might contribute to the apoptotic process observed after pathological NMDA 
receptor activation. Furthermore, it has been suggested that the Ca 2+ entry via 
the extra-synaptic NMDA receptors potently contribute to the Ca 2+ loading and 
depolarization of mitochondria and hence to activation of cell death pathways 
(see Fig. 10). This difference between the physiological and pathological action of 
the synaptic versus extra-synaptic NMDA receptors may depend on differences 
in the molecular networks to which the NMDA receptors, especially with the 
C-termini of their subunits, are coupled. This is a very interesting speculation 
of Hardingham and Bading [58] based on some experimental evidence and gives 
a rational to the demonstrated pro-survival action of synaptic NMDA receptors 
versus the pro-death action of the extra-synaptic NMDA. However, one may 
wonder of the functional meaning of the surprising experimental evidence that 
the same transmitter (glutamate) via the same type of receptors (NMDA) acti- 
vate or inhibit CREB, dependent on the membrane domain where the NMDA 
receptors are localized. In our opinion this may be a mechanism that allows not 
only the specification of the spatial localization of the activated NMDA input, 
but also can help in creating a temporal code of CREB activation related to the 
temporal pattern of synaptic inputs via NMDA receptors. Thus, the temporal 
code could be achieved by the integration of VT-signals (glutamate on extra- 
synaptic NMDA receptors) with WT-signals (glutamate on synaptic NMDA re- 
ceptors). As a matter of fact, it may be surmised that when the extra-synaptic 
NMDA receptors are activated a high threshold of Ca 2+ concentrations is set for 
CREB activation that becomes turned off as the extrasynaptic NMDA receptors 
becomes activated when the glutamate spillover reach them. 

4.5 Efficacy of the Connectivity 

Aspects of the communication process that are strictly linked to the interactions 
between computing elements within a network and cross-talk between networks 
are the efficacy of the connection and the perviousness of the channel. Efficacy 
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EFFECTS OF THE ACTIVATION OF SYNAPTIC versus EXTRA-SYNAPTIC 
NMD A RECEPTORS 




Fig. 10. Schematic representation of the different effects on cell signalling and cell 
survival of the activation of the NMDA receptors according to their localization in 
different membrane domains (synaptic domain versus extra-synaptic domains). The 
hypothesis is put forward that a possible functional meaning of these markedly different 
effects is to allow the creation of a temporal code in CREB activation related to the 
pre-synaptic firing rate of glutamatergic terminals. For further details, see text. 



of the connection has been classically thought of as a change of the “synap- 
tic weight” of the connection between the computing elements within a neural 
network. The perviousness of the channel between computing elements has been 
related to the presence and action of a “modifier” localised along the channel and 
capable of affecting the incoming signal during its transmission [13]. A change 
of the “synaptic weight” has been proven as an important mechanism for plas- 
tic adaptations of a neural network and as such it has been related to learning 
processes and it has been applied for the training of artificial neuronal networks. 
However, a change in the perviousness of the communication channels between 
computing elements can also be seen as a plastic feature of the brain and actions 
on the communication capabilities of channels in a network can be related to 
some kind of learning process. This view is based on the pioneering work of Alan 
Turing that proposed the so-called Turing’s B-type unorganised machine that 
is a device that via changes in the perviousness of the communication channels 
connecting its computational elements could learn a task [13], [51]. 
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While changes in the synaptic weights are clearly a WT-based learning pro- 
cess, changes in the perviousness of communication channels can more easily 
take place in the VT-communication mode. Thus, we have proposed that not 
only a WT mode, but also a VT mode can be at the basis of learning processes. 
Obviously, most of the times, both modes are very likely used by neural networks 
in the learning process. 

In agreement with our proposal of a fractal organization of the CNS, we 
propose that Turing’s B-type unorganized machine can be of importance also 
for molecular networks. It can be surmised that if the principles of this type of 
machine work in the fringes between molecular networks it is possible for single 
molecular networks to be organized in complex systems via the control of the 
modifiers localized along the connecting channels (Fig. 11). 

Hence, it is possible not only for the biochemical machinery of the cell to 
perform complex operations, but also learning, i.e. , store information [33]. If this 
functional architecture of molecular networks is reasonable, it could be important 
to investigate, not only if some form of learning and engram storage could occur 
at the level of molecular networks, but if some particular type of learning takes 
place at the level of HMNs, VMNs and KMNs, respectively. 




Fig. 11. Schematic representation of fringes between molecular networks. It can be sur- 
mised that if the principles of a Turing type-B unorganised machine work in the fringes 
(i.e., in the channels operating as bridges between molecular networks), it is possible 
to have modules (the single molecular networks) that can be organized in complex sys- 
tems (see text) via the control of the “modifiers” localized along these bridges. Hence, 
it is possible not only for the biochemical machinery of the cell to perform complex 
operations, but also learning, i.e., store information. For further details, see text and 
Fig. 3. 





On the Nested Hierarchical Organization of CNS 



47 



5 Focus on Molecular Networks: 

How Elaborations Are Carried Out 

The above-mentioned operation principles can be used to describe some char- 
acteristics of the functional organization of molecular networks. Of particular 
importance are the concept of fringe and the related concept of channels regu- 
lated by a modifier since they allow dealing with the aspects of the integrative 
actions of molecular signal networks. 

Some qualitative/quantitative parameters could be used to characterize the 
fringes (Fig. 11), such as the size of the area of overlap expressed as the number 
of nodes and channels within it and the number of modifier-regulated channels. 
It could also be interesting to known if crucial nodes (“hubs”) belong to it. 

Biological networks show many important functional characteristics such as: 
robustness, redundancy, structural stability, modular design, integral feedbacks, 
plasticity. Let us define and briefly comment on two of these characteristics, 
namely redundancy and plasticity. As Edelman underlines, the concept of re- 
dundancy should be distinguished from the concept of “degeneracy” [60] . 

Redundancy is achieved by many autonomous units, which carry out identical 
roles. Degeneracy is the ability of elements that are structurally different to 
perform the same function or to yield the same output. This property has a great 
impact also in signalling pathways since components of these networks commonly 
bind to and affect the functional properties of more than one downstream target 
producing branched networks (necessary condition to have “fringes” ) and causing 
similar and/or different biological responses. The result of degeneracy in the 
latter case is that of producing what we have called “syndromic responses” at 
the level of cellular or molecular networks (see above) . 

Plasticity is the capability of a system (e.g., a neuronal or a molecular net- 
work) to change its functional meaning and, hence, it is related to degeneracy. 
One of the best examples of degeneracy at neuronal network level is “synaptic 
plasticity” , where a huge number of degenerate mechanisms can similarly affect 
the synaptic efficacy (i.e., the “synaptic weight”), and therefore they can lead 
to the same functional change in the synaptic transmission [60] . 

6 General Discussion 

On the basis of these concepts a network analysis of the nested hierarchic or- 
ganization of the CNS could be carried out that should be supported and/or 
modified by both computer simulations and neurobiological experimental obser- 
vations. This is obviously a large scope project. The more restricted aim of the 
present paper is that of analysing some basic characteristics of molecular signal 
networks. A discussion on the some general features of signalling networks can be 
found in the work of Bhalla and Iyengar [61]. In their work biochemical signalling 
networks were constructed with experimentally obtained constants and analyzed 
by computational methods to understand their role in complex biological pro- 
cesses. These networks exhibit emergent properties such as integration across 
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multiple time scales, self-sustained feedback loops and bi-stable behaviour with 
well-defined input thresholds for transition between states and prolonged signal 
output. The authors point out that these properties of signalling networks raise 
the possibility that information from learned behaviour may be stored within 
intracellular biochemical reactions that comprise signalling pathways, as was al- 
ready suggested by Agnati and Fuxe in 1982 [17], [21], [41] and further stressed 
in the present paper. Thus, from a neurobiological standpoint, the relevance of 
a deep analysis of the MNs cannot be overestimated. From a neuropathological 
standpoint it is our opinion that pathogenetic mechanisms of several neuropsy- 
chiatric disorders reside in the MNs (see also, [23]). Of particular interest could 
be the assessment of the hubs present in a MN. For example, it may be surmised 
that dopamine receptors in the mesolimbic and striatal neuronal systems, are 
hubs and deeply affecting some downstream hub (e.g., DARPP-32), since alter- 
ations in dopamine receptor function in these areas are associated, e.g., with 
schizophrenia and abnormal involuntary movements, respectively. In agreement 
with this view and as discussed in previous papers [33] a special target for phar- 
macological intervention will be found in the HMNs and, in particular, in the 
RRIs. 

Another important facet of the molecular network analysis is its merging with 
the Artificial Intelligence world and with the efforts that currently are done in 
the development of computing devices based on bio-molecules especially on DNA 
and proteins. Let us discuss briefly this issue. Von Newmann, in his pioneering 
book “The Computer and the Brain” , stated that both logics and mathematics 
are historical, accidental forms of expression and the brain could use a communi- 
cation code that has no meaning for us [62]. As pointed out in the Introduction, 
a holistic approach is needed taking into account the “animating principles” that 
make from many different nested hierarchic levels a single complex system. Ac- 
cording to Bacon’s view a basic step to this aim is the investigation of the logic 
behind the phenomenon under study. In our opinion, a logic is needed capable 
of describing organization and operations of the elements within each hierarchic 
level of the network and to combine the different levels. The development of a 
“fractal logic” capable of dealing with systems of different miniaturization lev- 
els, organized in a nested hierarchic fashion and operating according to the same 
rules at each level has been suggested. 

This CNS organization puts also under a different perspective the so-called 
“binding phenomenon”. Let’s discuss briefly this aspect. In the CNS a large num- 
ber of parallel pathways exist that process different but related information that 
is eventually brought together in a single percept. This phenomenon has been 
called the “binding phenomenon” . As matter of fact, the binding phenomenon 
is much wider than before thought if we consider the multiple miniaturization 
levels and the multiple computing networks present at each level. In our opinion 
there are binding phenomena within each level ( “horizontal binding” ) and across 
the multiple hierarchic levels (“vertical binding”). It has been pointed out that 
this new type of binding (i.e., the “vertical binding”) could be discussed in the 
frame of the functional role of local circuits. Local circuits may be the meso-scale 
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level that is particularly involved in allowing the vertical binding especially be- 
tween two crucial hierarchic levels: the molecular network level and the neural 
network level. 

The last point that we would like to discuss is the possible impact that 
studies on the molecular networks, local circuits and neural networks organized 
according to a nested hierarchic principle can have on computer technology. In 
the present discussion we will briefly mention possible developments that these 
studies could allow in the building up of a new generation of computers. A great 
effort is currently made to use molecules for the miniaturization of electronic 
devices. Most of these efforts to miniaturize computers assume that these devices 
will continue to operate using electrons to carry information and transistors to 
process it. Another approach is based on the use of DNA and the appropriate 
enzymes that can manipulate the information stored in the DNA molecule in 
a highly parallel manner. We suggest a different approach based on concepts 
derived from the organization principles and cellular and molecular substrates 
used by the CNS for the information handling. 

Our proposal rests on the merging between Nanotechnology and Bionics that 
should lead to a new discipline: Nanobionics [21]. Let us explain the aim of 
Nanobionics moving from Nanotechnology, which has been defined as the build- 
ing of machines at nanoscale level. On the other hand, Bionics has been defined 
as the attempt to develop better machines through the understanding of bio- 
logical design principles used by living beings to perform similar tasks that the 
human-made machine should perform. Hence, Nanobionics indicates the em- 
ployment of biological design principles (at nanoscale level) to develop a new 
generation of nanoscale machines (Fig. 12). Our proposal opens up a new field: 
the development of machines capable of processing information through the un- 
derstanding of the biological principles used by the horizontal, vertical and kernel 
molecular networks present in neural cells [8], [21]. The proposed application of 
the Nanobionics should allow the planning and construction of a new genera- 
tion of molecular computing devices, which promise to be largely more powerful 
and miniaturized than the silicon based computing devices. In the field of Nan- 
otechnology it has been recognized that one of the main difficulties resides in 
the communication between the nano- world and the macro- world. This difficulty 
represents a central issue in the development of Nanotechnology. Some sugges- 
tions to overcome this difficulty could come from Nanobionics. In the present 
paper the meso-scale role of local circuits has been underlined. Local circuits 
seem to form a structure capable of supporting the micro- world (i.e., the molec- 
ular networks) and at the same time capable of being part of the elaborating 
elements of the macro-world (i.e., the neural networks). These capabilities ob- 
viously represent a great advantage from a space-, time- and an energy-saving 
point of view. Thus, the structural and operational arrangements that allow lo- 
cal circuits to carry out their meso-scale role could indicate a pathway to follow 
for the development of nano-computing devices. 

Another difficulty is the practical building of machine at molecular level, i.e., 
to work with single molecules and to arrange them in a planned fashion. Also 
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Fig. 12. Schematic representation of the Nanobionics approach based on the evidence 
obtained in the study of the horizontal molecular networks. For further details, see 
text. 



in this case Nanobionics can give important hints and offer efficient tools. In 
the present paper the potential importance of the “Lego property”, i.e. , of the 
self-assembling capability of proteins, has been underlined [17]. This property 
strongly indicates the possibility to use a bottom-up approach for manufactur- 
ing these new types of nano-computing devices. Thus, it seems possible to put 
in close vicinity proteins (i.e., the molecules that should form the modules of 
the computing elements). Then, if we have selected proteins that could prop- 
erly interact thanks to their Lego properties, these molecules will spontaneously 
assembly in the supra-molecular complex that can carry out the requested logi- 
cal operations. It should also be noticed that the supporting structure of these 
molecular networks should not be an inert uniform substrate, but instead an 
artificial bio- membrane with appropriate micro-domains (e.g., specialized lipid 
rafts) capable of favoring the insertion of the supra-molecular complexes in an 
ordered and meaningful way. 
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Abstract. I review two studies that both deal with radical changes in 
neuronal circuitry, presumed to have occurred at the transition from 
early reptilians to mammals: the lamination of sensory cortex and the 
differentiation into sub-fields of the mammalian hippocampus. In neither 
case the qualitative structural change seems to be accompanied by an 
equally dramatic functional change in the operation of those circuits. 
Both studies discuss the evolution of cortical networks in terms of their 
computations, quantified by simulating simplified formal models. The 
models can be conceived as variants of a basic autoassociative neural 
network model, whose storage capacity plays an important role in the 
results. Both studies dwell on the interrelationship between qualitative 
and quantitative change, and both studies include, as a necessary in- 
gredient of the relevant computational mechanism, a simple feature of 
pyramidal cell biophysics: firing rate adaptation. 



1 A Structural Phase Transition 

Mammals originate from the therapsids, one order among the first amniotes, or 
early reptiles, as they are commonly referred to. They are estimated to have 
radiated away from other early reptilian lineages, including the anapsids (the 
progenitors of modern turtles) and diapsids (out of which other modern reptil- 
ians, as well as birds, derive) some three hundred million years ago [14]. Perhaps 
mammals emerged as a fully differentiated class out of the tlrird-to-last of the 
great extinctions, in the Triassic period. The changes in the organization of the 
nervous system, that mark the transition from proto-reptilian ancestors to early 
mammals, can be reconstructed only indirectly. Along with supporting argu- 
ments from the examination of endocasts (the inside of fossil skulls; [30]) and of 
presumed behavioural patterns [58], the main line of evidence is the compara- 
tive anatomy of present day species [19]. Among a variety of quantitative changes 
in the relative development of different structures, changes that have been ex- 
tended, accelerated and diversified during the entire course of mammalian evo- 
lution [23], two major qualitative changes stand out in the forebrain, two new 
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features that, once established, characterize the cortex of mammals as distinct 
from that of reptilians and birds. Both these changes involve the introduction of 
a new “input” layer of granule cells. 

In the first case, it is the medial pallium (the medial part of the upper surface 
of each cerebral hemisphere, as it bulges out of the forebrain) that reorganizes 
into the modern-day mammalian hippocampus. The crucial step is the detach- 
ment of the most medial portion, that loses both its continuity with the rest 
of the cortex at the hippocampal sulcus, and its projections to dorso-lateral 
cortex [55]. The rest of the medial cortex becomes Ammon’s horn, and retains 
the distinctly cortical pyramidal cells, while the detached cortex becomes the 
dentate gyrus, with its population of granule cells, that project now, as a sort 
of pre-processing stage, to the pyramidal cells of field CA3 [3]. In the second 
case, it is the dorsal pallium (the central part of the upper surface) that reorga- 
nizes internally, to become the cerebral neocortex. Aside from special cases, most 
mammalian neocortices display the characteristic isocortical pattern of lamina- 
tion, or organization into distinct layers of cells (traditionally classsified as 6, in 
some cases with sublayers) [41,46, 18]. The crucial step, here, appears to be the 
emergence, particularly evident in primary sensory cortices, of a layer of non- 
pyramidal cells (called spiny stellate cells, or granules) inserted in between the 
pyramidal cells of the infragranular and supragranular layers. This is layer IV, 
where the main ascending inputs to cortex terminate [20] . 



1.1 An Information-Theoretical Advantage in the Hippocampus 

What is the evolutionary advantage, for mammals, brought about by these 
changes? In the case of the hippocampus, attempts to account for its remarkable 
internal organization have been based, since the seminal paper by David Marr 
[42], on the computational analysis of the role of the hippocampus in memory. 
The hippocampus is important for spatial memory also in birds. A reasonable 
hypothesis is that the “invention” of the dentate gyrus enhances its capability, in 
mammals, to serve as a memory store. Following the approach outlined by David 
Marr, it was proposed 12 years ago [49] that the new input to CA3 pyramidal 
cells from the mossy fibers (the axons of the dentate granule cells) serves to 
create memory representations in CA3 richer in information content than they 
could have been otherwise. The crucial prediction of this proposal was that the 
inactivation of the mossy fiber synapses should impair the formation of new hip- 
pocampal dependent memories, but not the retrieval of previously stored ones. 
This prediction has recently been supported [35] at the behavioural level in mice, 
while neurophysiological experiments are in progress with rats. If validated, this 
hypothesis suggests that indeed a quantitative, information-theoretical advan- 
tage may have favored a qualitative change, such as the insertion of the dentate 
gyrus in the hippocampal circuitry. 

This possibility raises the issue of whether also the insertion of layer IV in the 
isocortex might be accounted for in quantitative, information-theoretical terms, 
an issue discussed in the next section. 
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Fig. 1 . The structural phase transition in the hippocampus. The medial pallium of a 
reptile (a lizard), top, with indicated the Large Cell (LC) and Small Cell (SC) subdi- 
visions. Examples of the reorganized medial pallium in 4 highly divergent mammalian 
species, bottom: A - platypus; B - rat; C - cat; D human. The homolog of the SC 
subdivision has become the detached dentate gyrus, which sends connections to the 
CA3 portion of the homolog of the LC subdivision, that has remained continuous with 
the rest of the cortex. 



At the same time, the DG argument does not itself address the CA3-CA1 
differentiation, which is equally prominent in the mammalian hippocampus. Sec- 
tion 3 will review a computational approach to the hippocampus problem, and 
mention fresh experimental results that are shading an entirely new light on it. 
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1.2 An Information-Theoretical Hypothesis about Layers and Maps 

It has long been hypothesized that isocortical lamination appeared together with 
fine topography in cortical sensory maps [2] , pointing at a close relationship be- 
tween the two phenomena. All of the cortex, which develops from the upper half 
of the neural tube of the embryo, has been proposed to have been, originally, sen- 
sory, with the motor cortex differentiating from the somatosensory portion [38, 
21]. In early mammals, the main part of the cortex was devoted to the olfactory 
system, which is not topographic, and whose piriform cortex has never acquired 
isocortical lamination [25]. The rest of the cortex was largely allocated to the 
somatosensory, visual and auditory system, perhaps with just one topographic 
area, or map, each [19]. Each sensory map thus received its inputs directly from 
a corresponding portion of the thalamus, as opposed to the network of cortico- 
cortical connections which has been greatly expanded [1, 13] by the evolution of 
multiple, hierarchically organized cortical areas in each sensory system [32,34]. 
In the thalamus, a distinction has been drawn [31] between its matrix and core 
nuclei. The matrix, the originally prevalent system, projects diffusely to the up- 
per cortical layers; while the core nuclei, which specialize and become dominant 
in more advanced species [22], project with topographic precision to layer IV, 
although their axons contact, there, also the dendrites of pyramidal cells whose 
somata lie in the upper and deep layers. 

The crucial aspect of fine topography in sensory cortices is the precise cor- 
respondence between the location of a cortical neuron and the location, on the 
array of sensory receptors, where a stimulus can best activate that neuron. Sim- 
ple visual and somatosensory cortices thus comprise 2D maps of the retina and 
of the body surface, while auditory cortices map sound frequency in 1 dimen- 
sion, and what is mapped in the other dimension is not quite clear [45]. Some 
of the parameters characterizing a stimulus, those reflected in the position of 
the receptors it activates, are therefore represented continuously on the cortical 
sheet. We define them as providing positional information. Other parameters, 
which contribute to identify the stimulus, are not explicitly mapped on the cor- 
tex. For example, the exact nature of a tactile stimulus at a fixed spot on the 
skin, whether it is punctuate or transient or vibrating, and to what extent, are 
reflected in the exact pattern of activated receptors, and of activated neurons 
in the cortex, but not directly in the position on the cortical sheet. We define 
these parameters as providing identity information. Advanced cortices, like the 
primary visual cortex of primates, include complications due to the attempt to 
map additional parameters on the sheet, like ocular dominance or orientation, 
in addition to position on the retina. This leads to the formation of so-called 
columns, or wrapped dimensions, and to the differentiation of layer IV in mul- 
tiple sublayers. They should be regarded as specializations, which likely came 
much after the basic cortical lamination scheme had been laid out. 

The sensory cortices of early mammals therefore received from the thalamus, 
and had to analyse, information about sensory stimuli of two basic kinds: po- 
sitional or where information, I pi and identity or what information, J^. These 
two kinds differ also in the extent to which cortex can contribute to the analysis 
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of the stimulus. Positional information is already represented explicitly on the 
receptor array, and then in the thalamus, and each relay stage can only degrade 
it. At best, the cortex can try to maintain the spatial resolution with which the 
position of a stimulus is specified by the activation of thalamic neurons: if these 
code it inaccurately, there is no way the cortex can reconstruct it any better, 
because any other position would be just as plausible. The identity of a stimulus, 
however, may be coded inaccurately by the thalamus, with considerable noise, 
occlusion and variability, and the cortex can reconstruct it from such partial 
information. This is made possible by the storage of previous sensory events 
in terms of distributed efficacy modifications in synaptic systems, in particular 
on the recurrent collaterals connecting pyramidal cells in sensory cortex. Neural 
network models of autoassociative memories [42, 29] have demonstrated how sim- 
ple “Hebbian” rules modelling associative synaptic plasticity can induce weight 
changes that lead to the formation of dynamical attractors [5] . Once an attractor 
has been formed, a partial cue corresponding e.g. to a noisy or occluded version 
of a stimulus can take the recurrent network within its basin of attraction, and 
hence lead to a pattern of activation of cortical neurons, which represents the 
stored identity of the original stimulus. Thus by exploiting dislromogeneities in 
the input statistics - some patterns of activity, those that have been stored, are 
more “plausible” than others - the cortex can reconstruct the identity of stimuli, 
over and beyond the partial information provided by the thalamus. This analysis 
of current sensory experience in the light of previous experience is hypothesized 
here to be the generic function of the cortex, which thus blends perception with 
memory [56]. Specialized to the olfactory sense, this function does not seem to 
require new cortical machinery to be carried out efficiently. A novel circuitry 
may instead be advantageous, when the generic function is specialized to topo- 
graphic sensory systems, which have to relay both where and what information, 
I p and Jj. We take up the validation of this possibility in the next section. 

In the past two decades, physicists have studied various models of autoasso- 
ciative memory using different model neurons and different “learning rules” to 
implement Hebbian learning [29,6]. Mathematical methods have been adapted 
from statistical and spin glass physics for the purpose of analyzing these neu- 
ronal networks [4] . Although most of these investigation have been made on very 
abstract and simplified models, they have provided us with a good understand- 
ing of the general properties of associative memories, e.g. storage capacity and 
retrieval dynamics. I have exploited this accumulated understanding by using, in 
the simulations reviewed in the next two sections, variants of an autoassociative 
network model comprised of threshold-linear units and including a geometrical 
organization of neuronal connectivity. Such a model lends itself, besides simu- 
lations, to analytical studies, which I have carried out in the past on simpler 
variants [48, 50] , have been extended into a sophisticated analysis of hippocam- 
pal chart storage by Francesco Battaglia [9], and are being further developed at 
present by Yasser Roudi and others within my LIMBO group. 
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2 Validation of the Lamination Hypothesis 

Does preserving accurate coding of position, in an isocortical patch, conflict 
with the analysis of stimulus identity? This is obviously a quantitative question, 
which has to be addressed with a suitable neural network model. An appropriate 
model can be designed with similar features as the one considered above, but 
with the additional option of differentiating multiple layers. In particular, the 
model of a cortical patch, receiving inputs from a thalamic array of units, can be 
investigated in its ability to generate localized retrieval states, that correspond 
to the stored patterns modulated by bumps, studied analytically in the section 
above. Unlike the analytical study, which is easier to conduct in a well defined 
limit case, e.g. looking at the existence of asymptotic attractor states after an 
afferent cue that has initialized activity in the network has been removed, with 
simulations one can also study the dynamics of localization and retrieval in time, 
with a cue that follows its own time course. Contrasting a model network with 
differentiated layers with one that has the same number of units and connections, 
but statistically equivalent layers, allows to approach the question of the role of 
lamination. The presence of several layers would anyway make an analytical 
treatment, while not impossible, very cumbersome, and computer simulations 
appear to be the method of choice. This is the approach taken in Ref. [53], the 
results of which are briefly summarized here. 

A patch of cortex was modelled as a wafer of 3 arrays, each with Nx N units. 
Each unit receives Cff feedforward connections from a further array of N x N 
“thalamic” units, and C rc recurrent connections from other units in the patch. 
Both sets of connections are assigned to each receiving unit at random, with a 
Gaussian probability in register with the unit itself, and of width Sff and S rc , 
respectively 1 . To model, initially, a uniform, non-laminated patch, the 3 arrays 
are identical in properties and connectivity, so the C rc recurrent connections each 
unit receives are drawn at random from all arrays. To model a laminated patch, 
later, different properties and connectivities are introduced among the arrays, 
but keeping the same number of units and connections, to provide for a correct 
comparison of performance. The 3 arrays will then model supragranular, granular 
and infragranular layers of the isocortex [53]. A local pattern of activation is 
applied to the thalamic units, fed forward to the cortical patch and circulated 
for Nuer time steps along the recurrent connections, and then the activity of 
some of the units in the patch is read out. To separate out “what” and “where” 
information, the input activation is generated as the product of one of a set of M 
predetermined global patterns, covering the entire N x N input array, by a local 
focus of activation, defined as a Gaussian tuning function of width R, centered at 
any one of the N 2 units. The network operates in successive training and testing 
phases. In a training phase, each of the possible MxNxN activations is applied, 
in random sequence, to the input array; activity is circulated in the output 
arrays, and the resulting activation values are used to modify connections weights 

1 Periodic boundary conditions are used, to limit finit size effects, so the patch is in 

fact a torus. 
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according to a model associative rule. In a testing phase, input activations are 
the product of a focus, as for training, by a partial cue, obtained by setting a 
fraction of the thalamic units at their activation in a pattern, and the rest at 
a random value, drawn from the same general distribution used to generate the 
patterns. The activity of a population of output units is then fed into a decoding 
algorithm - external to the cortical network - that attempts to predict the actual 
focus (its center, p) and, independently, the pattern i used to derive the partial 
cue. Ii is extracted from the frequency table P(i,id) reporting how many times 
the cue belonged to pattern i = 1 , ,M but was decoded as pattern id : 

/< = £?(>,<,<) iog 2J ^£j (i) 

and a similar formula is used for I p . The learning rule used to modify connection 
weights was 

Awij oc r post ■ (r pre — < r pre >) (2) 

applied, at each presentation of each training phase, to weight Wy . Weights 
are originally set at a constant value (normalized so that the total strength 
of afferents equals that of recurrent collaterals), to which is added a random 
component of similar but asymmetrical mean square amplitude, to generate 
an approximately exponential distribution of initial weights onto each unit, r 
denotes the firing rates of the pre- and postsynaptic units, and < ... > an 
average over the corresponding array. 

Among the several parameters that determine the performance of the net- 
work, R « S rc was fixed, while Sff was varied from Sff ~ R up to Sff ~ S rc . 
It is intuitive that if the feedforward connections are focused, Sff ~ R, “where” 
information can be substantially preserved, but the cortical patch is activated 
over a limited, almost point-like extent, and it may fail to use efficiently its recur- 
rent collaterals to retrieve “what” information. If the other hand Sff — S rc , the 
recurrent collaterals can better use their attractor dynamics, leading to higher 
Ii values, but the spread of activity from thalamus to cortex means degrading 
I p . This conflict between I p and l t is depicted in Fig. 2, which reports their joint 
values extracted from simulations, as they vary as a function of the spread of 
the afferents, at the end of the training phase (full curve). What is decoded is 
the activity of all units in the upper array of the patch. Since the patch is not 
differentiated, however, the other two arrays provide statistically identical infor- 
mation. Further, since information of both the what and where kinds is extracted 
from a number of units already well in the saturation regime [52] , even decoding 
all units in all 3 arrays at the same time, or only, say, half of the units in any 
single array, does not alter the numbers significantly. l t is monotonically increas- 
ing with Sff. I p , instead, decreases with Sff, and as a result one can vary Sff 
to select a compromise between what and where information, but not optimise 
both simultaneously. This conflict between what and where persists whatever the 
choice of all the other parameters of the network, although of course the exact 
position of the I p — /, limiting boundary varies accordingly. Is it possible to go 
beyond such boundary? 
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2.1 Differentiation among Isocortical Layers 

Several modifications of the “null hypothesis” uniform model were explored, as 
reported in [53]. Figure 2 illustrates, along with the results of the uniform model, 
results pertaining to slightly different versions of a 3-layer laminated model. Ba- 
sically, the granular layer is differentiated by (i) focusing the thalamic afferents 
to the granular layer, while those to the two pyramidal layers are diffuse; (ii) 
restricting the recurrent collateral system of the granular units, by focusing the 
connections departing from granular units and decreasing the number of connec- 
tions arriving at layer IV from the pyramidal layers; finally (iii) layer IV units 
follow a non-adaptive dynamics, and they do not operate during training, but 
only during testing. The non-adapting dynamics is effected, in the simulations by 
making their effect on postsynaptic units, whatever their layer, scale up linearly 
with iteration cycle. Thus, compared to the model pyramidal units, whose firing 
rate would adapt over the first few interspike intervals, in reality (but is kept 
in constant ratio to the input activation, in the simulations), the firing rate of 
granule units, to model lack of adaptation, is taken to actually increase in time 
for a given input activation. 

Differentiating infra- from supra-granular connections is effected by simply 
replacing the connections from layer IV to the infragranular pyramidal units 
with connections to the same units from supragranular units. In the real cortex, 
the supragranular layers project mainly onward, to the next stage of processing. 
The infragranular layers project mainly backward [7], or subcortically. Among 
their chief target structures are the very thalamic nuclei from which projections 
arise to layer IV. It is clear that having different preferential targets would in 
principle favour different mixes of what and where information. In particular, 
cortical units that project back to the thalamus would not need to repeat to 
the thalamus “where” a stimulus is, since this information is already coded, and 
more accurately, in the activity of thalamic units. They would rather report in its 
full glory the genuine contribution of cortical processing, that is, the retrieval of 
identity information. Units that project to further stages of cortical processing, 
on the other hand, should balance the “what” added value with the preservation 
of positional information. With this combination of modifications, layer III be- 
comes the main source of recurrent collaterals [43, 59], which are spread out and 
synapse onto both supra- and infra-granular units and also, to a lesser degree, 
layer IV units. 

The effect of the overall model of the differentiation can be appreciated by 
decoding the activity in the three layers, separately, as shown in Fig. 2 by the 
isolated symbols. From layer IV one can extract a large I p but limited Ii\ from 
layer III one obtains a balanced mix. From layer V, on the other hand, one can 
extract predominantly “what” information, Jj, at the price of a rather reduced 
I p content. Thus, the last connectivity change, by effectively reducing the cou- 
pling between granular and infragranular layers, has made the latter optimize 
“what” information, while neglecting “where” information, of limited interest 
to their target structures. Can we understand the advantage brought about by 
lamination? The modifications required in the connectivity of layer IV are in- 
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Fully differentiated model 




Fig. 2 . 1, i and I p values obtained, after 3 training epochs, with the uniform model and, 
for 4 different parameter choices, for the fully differentiated model. 3 of the data points 
for the infragranular layer (black triangles) are nearly superimposed. 



tuitive: they make granule units more focused in their activation, in register 
with the thalamic focus, while allowing the pyramidal units, that receive diffuse 
feedforward connections, to make full use of the recurrent collaterals. What is 
less intuitive is the requirement for non-adapting dynamics in the granule layer. 
It turns out that without this modification in the dynamics, the laminated net- 
work essentially averages linearly between the performances of uniform networks 
with focused and with diffuse connectivity, without improving at all on a case 
with, say, intermediate spread parameters for the connections. This is because 
the focusing of the activation and the retrieval of the correct identity interfere 
with each other, if carried out simultaneously, even if the main responsability for 
each task is assigned to a different layer. Modifying the dynamics of the model 
granules, instead, enables the recurrent collaterals of the pyramidal layers to first 
better identify the attractor, i.e. the stored global pattern, to which the partial 
cue “belongs”, and to start the dynamical convergence towards the bottom of 
the corresponding basin of attraction [4]. Only later on, once this process is - 
in most cases - safely underway, the granules make their focusing effect felt by 
the pyramidal units. The focusing action, by being effectively delayed after the 
critical choice of the attractor, interferes with it less - hence, the non-linear 
advantage of the laminated model. 
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3 What Do We Need DG and CA1 for? 

If the synapses on the recurrent collaterals among pyramidal cells of the primi- 
tive cortex were endowed, as it is likely, with associative, “Hebbian”, plasticity, 
such as that based on NMDA receptors [17], that cortex could have operated 
as an associative memory [13] - provided it had an effective way of distinguish- 
ing its operating modes. A generic problem with associative memories based on 
recurrent collaterals is to distinguish a storage mode from a retrieval mode. To 
be effective, recurrent collaterals should dominate the dynamics of the system 
when it is operating in retrieval mode; whereas while storing new information 
the dynamics should be primarily determined by afferent inputs, with limited 
interference from the memories already stored in the recurrent collaterals. The 
recurrent collaterals, instead, should modify their weights to store the new infor- 
mation [49]. In the model usually studied analytically, the learning phase is not 
explicitly considered. In the simulations of the laminated model, in the previous 
section, the distinction is partially inserted by hand, by forcing the layer IV 
units to be silent during training. In the simulations [54] and in the experiments 
mentioned in this section, instead, the learning phase is studied explicitly. 



3.1 Distinguishing Storage from Retrieval 

The most plrylogenetically primitive solution to distinguish storage from retrieval 
is to use a modulator which acts differentially on the afferent inputs (originally, 
those arriving at the apical dendrites) and on the recurrent connections (predom- 
inantly lower on the dendritic tree). Acetylcholine (ACh) can achieve this effect, 
exploiting the orderly arrangement of pyramidal cells dendrites [26]. Acetyl- 
choline is one of several very ancient neuromodulating systems, well conserved 
across vertebrates, and it is likely that it operated in this way already in the 
early reptilian cortex, throughout its subdivisions. In recent years, Mike Has- 
selmo has been emphasizing this role of ACh in memory, with a combination 
of slice work and neural network modeling [27, 28]. This work has been focused 
on the hippocampus - originally, the medial wall - and on piriform cortex - 
originally, the lateral wall. In the hippocampus, however, it appears that mam- 
mals have devised a more refined trick to separate storage from retrieval, and 
perform both efficiently: operating the dentate gyrus preprocessor. It is illumi- 
nating, in fact, to contrast the avian and mammalian hippocampi. They are 
structurally very different, with birds having stayed close to their reptilian pro- 
genitors, and mammals having detached the dentate gyrus from Ammon’s Horn, 
as mentioned above. Yet, at the behavioural level, the hippocampus of birds has 
been implicated in spatial memory in a role qualitatively similar to the prevail- 
ing description for the rodent hippocampus. Evidence comes from pigeons [10] 
and other species, and there is extensive literature to document it [15, 16]. 

Initially, the neural network approach, aiming at explaining structure from 
function, seemed to apply indiscriminately to hippocampal function in both birds 
and mammals, and therefore to be unable to say anything about the structural 
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differences between the two. In his early paper, David Marr guessed the impor- 
tance of recurrent collaterals, a prominent feature of the CA3 subfield [3], even 
though his own model was not really affected by the presence of such collaterals, 
as shown later [57]. Although the paper by Marr was nearly simultaneous with 
two of the most exciting experimental discoveries related to the hippocampus, 
that of place cells [44] and that of long term synaptic potentiation [11] for a long 
time it did not seem to inspire further theoretical analyses - with the exception 
of an interesting discussion of the collateral effect in a neural network model [24] . 
Marr himself become disillusioned with his youthful enthusiasm for unraveling 
brain circuits, and in his mature years took a much more sedate - and less neural 
- interest in vision. From 1987, however, McNaughton and Morris (1987) and 
then an increasing number of other investigators rediscovered the young Marr, 
and tried to elaborate those ideas in order to pin down the contribution of spe- 
cific elements of the hippocampal circuitry. Edmund Rolls (1989) and several 
others have emphasized the crucial role probably played by the CA3 recurrent 
collaterals, that may form an autoassociator, a well studied network model of a 
content addressable memory. An autoassociator may subserve both the storage 
of episodic memories, e.g. in humans, and the storage of memory for space, e.g. 
in rats [8]). 

The emphasis on the essential role of the CA3 recurrent collaterals opened 
the way for attempting to understand the specialization of the dentate gyrus, in 
mammals [49] . A quantitative analysis of different network architectures (essen- 
tially, an autoassociator, CA3, operating with and without dentate gyrus to aid 
it in storing new memories) indicated an information theoretic advantage of one 
over the other in forming new representations. The models used were very ab- 
stract, and thus amenable to theoretical analysis [51] instead of just simulation, 
yet broadly consistent with generic cortical circuitry at all levels of details below 
the one being investigated. Conceptually, the function ascribed to the dentate 
is equivalent to the function ascribed to acetylcholine - to enhance the rela- 
tion between hippocampal activity and afferent inputs during memory storage. 
The quantitative argument, however, allows a functional prediction at the neural 
level, which can be tested with suitable experiments. The prediction is that if 
mossy fibers are inactivated, the system is not able to acquire new hippocampal 
memories; or, more precisely, new memories rich in their information content. 
It should be able, nevertheless, to retrieve the memories already in store (and 
perhaps to form very impoverished representations of new memories) . Somewhat 
surprisingly, the prediction has already been borne out of a purely behavioral 
experiment, in which mice were tested in a Morris water maze while transmission 
of dentate granule cell action potentials was reversibly blocked [35]. Physiolog- 
ical experiments using similar techniques, in conjunction with measures of the 
information content of neural patterns of activity, will allow for more stringent 
tests of the argument. The mammalian ‘invention’ of the dentate gyrus, an in- 
genuity which likely took a long time to evolve from the simpler early reptilian 
organization, may thus represent a quantitative not qualitative improvement: 
qualitatively, we had acetylcholine already; but we managed to further improve 
on that. 
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Fig. 3. Scheme of some of the main subfields and synaptic systems of the hippocampus 
proper. Redrawn from [47]. 

3.2 CA1 in Search of a Role 

If DG can be understood as a CA3 preprocessor, perhaps CA1 should be un- 
derstood as a CA3 postprocessor. Yet studies based solely on the notion of the 
usefulness of a further associative memory and recoding stage after CA3 (Treves, 
1995) failed to illustrate impressive advantages to adding such a stage. More in- 
teresting hints come from neuropsychological studies in rats [33] , that indicate a 
more salient role for CA1 along the temporal dimension. CA3 may specialize in 
associating information that was experienced strictly at the same time, whereas 
CA1 may link together, more than CA3, information across adjacent times. A 
way to formulate a qualitative implication of such a functional differentiation 
is to state that CA1 is important for prediction , i.e. for producing an output 
represention of what happened just after, at the time of storage, whatever is 
represented by the pattern of activity retrieved at the CA3 stage. Note, how- 
ever, that reading the Kesner review in full indicates that the table at the end is 
a well-meaning simplification. Their Fig. 31. 2 suggests that CA3 may be involved 
in temporal pattern separation just as much as CA1. Moreover, the role of either 
DG or CA3 in temporal pattern association has never really been assessed. Fur- 
ther, available studies on the role of CA1 fail to make a clear distinction between 
tasks in which massive hippocampal outputs to the cortex are crucial, and tasks 
in which a more limited hippocampal influence on the cortex may be sufficient. 
In the first case, lesioning CA1 should have an effect independently of what CA1 
specifically contributes to information processing, simply because one is sever- 
ing the main hippocampo-cortical output pathway. In the second, CA3 outputs 
through the fimbria/fornix could enable hippocampal-mediated influences to be 
felt, deprived, though, of the specific CA1 contribution. 

Structurally, CA3 and CA1 are contiguous portions of the dorsomedial cor- 
tex. When this reorganizes into the mammalian hippocampus, CA3 and CA1 
differentiate in two important ways. First, only CA3 receives the projections 
from the dentate gyrus, the mossy fibers. Second, only CA3 is dominated by 
recurrent collaterals, while most of the inputs to CA1 cells are the projections 
from CA3, the Schaffer collaterals (Amaral et al, 1990). In [54] the hypothe- 
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sis was explored that the differentiation between CA3 and CA1 may help solve 
precisely the computational conflict between pattern completion, or integrating 
current sensory information on the basis of memory, and prediction, or moving 
from one pattern to the next in a stored sequence. Neural network simulations, 
based on the same sort of model as those studied analytically [48, 50] and as that 
applied to laminar cortex in section 2, were used to assess to what extent CA3 
would take care of retrieval, while CA1 could concentrate on prediction. With 
the simulations, at the price of some necessary simplification, I could compare 
the performance of the differentiated circuit with a non-differentiated circuit 
of equal number and type of components (one in which CA3 and CA1 have 
identical properties, e.g. both receive mossy fibers and are interconnected with 
recurrent collaterals). Lesion studies, instead, can only compare the normal cir- 
cuit with others with missing components, and it is thus difficult for them to 
say the last word on the meaning of a differentiation. The hypothesis was not 
really supported by my neural network simulations. The conflict indeed exists, 
but the crucial parameter that regulates it appears to be simply the degree of 
firing frequency adaptation in pyramidal cells. The differentiation between the 
architectures of CA3 and CA1 has a minor effect on temporal prediction, while 
it does significantly increase the information content of hippocampal outputs 
(see [54] for a full account). 

After those simulations were completed, new experimental results from the 
labs of Edvard Moser [39] and James Knierim [36] have shed a completely new 
light on the significance of the CA3-CA1 differentiation. As explained in forth- 
coming papers [40,37], activity in CA3 and CA1 differs remarkably when rats 
are asked to navigate in environments that some cues suggest are the same, and 
others indicate they are different. CA3 appears to take an all-or-none decision, 
usually allocating nearly orthogonal neural representations to even very similar 
environments, and switching to essentially identical representations only above 
a high threshold of physical similarity. Activity in CA1, instead, varies smoothly 
to reflect the degree of similarity. This functional differentiation, and the finding 
that new representations in CA3 emerge slowly, presumably through iterative 
processing, are entirely consistent with the recurrent character of the CA3 net- 
work, and the prevailing feedforward character of the CA1 network. Thanks to 
these experimental findings, therefore, we are beginning to finally ‘understand’ 
CA1, and to make complete sense of the events that drastically altered the struc- 
ture of our medial pallium nearly 200 million years ago. 
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Abstract. We review the use of mean field theory for describing the 
dynamics of dense, randomly connected cortical circuits. For a simple 
network of excitatory and inhibitory leaky integrate-and-fire neurons, 
we can show how the firing irregularity, as measured by the Fano factor, 
increases with the strength of the synapses in the network and with the 
value to which the membrane potential is reset after a spike. Generalizing 
the model to include conductance-based synapses gives insight into the 
connection between the firing statistics and the high-conductance state 
observed experimentally in visual cortex. Finally, an extension of the 
model to describe an orientation hypercolumn provides understanding 
of how cortical interactions sharpen orientation tuning, in a way that is 
consistent with observed firing statistics. 



1 Introduction 

Neocortical circuits are highly connected: a typical neuron receives synaptic in- 
put from of the order of 10000 other neurons. This fact immediately suggests 
that mean held theory should be useful in describing cortical network dynamics. 
Furthermore, a good fraction, perhaps half, of the synaptic connections are local, 
from neurons not more than half a millimeter away, and on this length scale (i.e., 
within a “cortical column” ) the connectivity appears to be highly random, with 
a connection probability of the order of 10%. This requires a level of mean held 
theory a step beyond the kind used for uniform systems in condensed matter 
physics like ferromagnets. It has to describe correctly the fluctuations in the in- 
puts to a given network element as well as their mean values, as in spin glasses. 
The theory we use here is, in fact, adapted directly from that for spin glasses. 

A generic feature of mean held theory for spin glasses and other random sys- 
tems is that the “quenched disorder” in the connections (the connection strengths 
in the network do not vary in time) leads to an effectively noisy input to a single 
unit that one studies: Spatial disorder is converted to temporal. The presence of 
this noise offers a fundamental explanation for the strong irregularity of bring 
observed experimentally in cortical neurons. For high connectivity, the noise is 
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Gaussian, and the correct solution of the problem requires its correlation func- 
tion to be found self-consistently. In this paper we summarize how to do this for 
some simple models for cortical networks. 

We focus particularly on the neuronal firing statistics. There is a long history 
of experimental investigations of the apparent noisiness of cortical neurons [7 
19], but very little in the way of theoretical work based on network models. Our 
work begins to fill that gap in a natural way, since the full mean field theory of a 
random network is based on self-consistently calculating the correlation function. 
In particular, we are able to identify the features of the neurons and synapses in 
the network that control the bring correlations. 

The basic ideas developed here were introduced in a short paper [1], and 
these models are treated in greater detail in several other papers [2-6] . Here we 
just want to give a quick overview of the mean held approach and what we can 
learn from it. 

2 Single Column Model, Current-Based Synapses 

In all the work described here, our neurons are of the leaky integrate-and-hre 
kind, though it is straightforward to extend the method to other neuronal mod- 
els, based, for example, on Hodgkin-Huxley equations. In our simplest model, 
we consider networks of excitatory and inhibitory neurons, each of which re- 
ceives a synaptic connection from every other neuron with the same probability. 
Each such connection has a “strength” , the amount by which a presynaptic spike 
changes the postsynaptic potential. In this model, these strengths are indepen- 
dent of the postsynaptic membrane potential (“current-based synapses”). All 
excitatory-to-excitatory connections that are present are taken to have the same 
strength, and analogously for the three other classes of connections (excitatory- 
to-inhibitory, etc.). However, the strengths for the different classes are not the 
same. In addition, excitatory and inhibitory neurons both receive excitation from 
an external population, representing “the rest of the brain”. (For primary sen- 
sory cortices, this excitation includes the sensory input from the thalamus.) This 
is probably the simplest generic model for a generic “cortical column” of spiking 
neurons. 

The network is taken to have Ni excitatory and IV 2 inhibitory neurons. A 
given neuron (of either kind) receives synaptic input from every excitatory (resp. 
inhibitory) neuron with probability K\/N\ (resp. K 2 /N 2 ), with K a /N a indepen- 
dent of a. In our calculations we take the connection density K a /N a to be 10%, 
but the results are not very sensitive to its value as long as it is fairly small. 
Each nonzero synapse from a neuron in population b to one in population a is 
taken to have the value J a b / V Kb ■ Synapses from the external population are 
treated in the same way, with strengths J a o/\/Ko- For simplicity, neurons in 
the external population are assumed to fire like stationary independent Poisson 
processes. We consider the limit K a — > 00 , N a — > 00 , with I\ a /N a fixed, where 
mean held theory is exact. 
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The subthreshold dynamics of the membrane potential of neuron i in popu- 
lation a obey 



<K 

d t 



a 2 A T b 

b= 0 j=l 






(1) 



where S b (t) = ^2 S d(t — t h jS ) is the spike train of neuron j in population b. The 
membrane time constant is taken to have the same value r for all neurons. We 
give the firing thresholds a narrow distribution of values (10% of the mean value, 
1). We take the firing thresholds 9 a = 1 and the postfiring reset levels to be 0. 
We ignore transmission delays. 

The essential point of mean field theory is that for such a large, homoge- 
neously random network, as for an infinite-range spin glass [20-22], we can treat 
the net input to a neuron as a Gaussian random process. This reduces the net- 
work problem to a single-neuron one, with the feature that the statistics of the 
input have to be determined self-consistently from the firing statistics of the sin- 
gle neurons. This reduction was proved formally for a network of spiking neurons 
by Fulvi Mari [23]. 

Explicitly, the effective input current to neuron i in population a can be 
written 

I?(t) = J2 MVK~ b r b + B b x? + \J 1 - K b /N b £f(t)}. (2) 

b 



Here r b = N b 1 ]>T r b is the average rate in population 6, 



b * = < 3 > 

x^ b is a unit-variance Gaussian random number, and £“ b (t) is a (zero-mean) 
Gaussian noise with correlation function equal to C b (t — t'), the average corre- 
lation function in population b. For the contribution from a single population, 
labeled by 6, the first term in (2), which represents the mean input current, is 
larger than the other two, which represent fluctuations, by a factor of 0(\j r K b ): 
averaging over many independent input neurons reduces fluctuations relative to 
those in a single neuron by the square root of the number of terms in the sum. 
(For our way of scaling the synapse strengths, the factor y/Kx in the first term 
arises formally from adding I\ b terms, each of which is proportional to 1 /y/I\ b .) 

However, while the fluctuation terms are small in comparison to the mean 
for a given input population 6, small compared to the population-averaged input 
(the first term in (2)), we will see that when we sum over all populations the 
first term will vanish to leading order. What remains of it is only of the same 
order as the fluctuations. Therefore fluctuations can not be neglected. 

The fact that the fluctuation terms are Gaussian variables is just a conse- 
quence of the central limit theorem, since we consider the limit K b — > oo. 

Note that one fluctuation term is static and the other dynamic. The origin of 
the static one is the fact that the network is inhomogeneous, so different neurons 
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will have different number of synapses and therefore different strengths of net 
time-averaged inputs. 

It is perhaps not immediately obvious, but the formal derivation ([23], see 
also [24] for an analogous case) shows that the dynamic noise also originates 
from the random inlromogeneity in the network. It would be absent if there were 
no randomness in the connections, as, for example, in a model like ours but with 
full connectivity. The presence of the factor sJl^KaJNa in the third term in (2) 
makes this point evident; in the general case the noise variance is proportional 
to the variance of the connection strengths. 



The Mean Field Ansatz 



In any mean field theory, whether it is for a ferromagnet, a superconductor, elec- 
troweak interactions, or a neural network, one has to make an ansatz describing 
the state in question. This ansatz contains some parameters (generally called 
“order parameters”), the values of which are then determined self-consistently. 
Here, our “order parameters” are the mean rates r^, their mean square values 
(which appear in (3), and the correlation functions Cb(t — t'). We make an ansatz 
for the correlation functions that describes an asynchronous irregular firing state: 
We take rb to be time-independent and Cb(t — t') to have a delta-function peak 
(of strength equal to Tb) at t = t', plus a continuous part that falls off toward 
zero as \t — t'\ — » oo. We could also look, for example, for solutions in which 
rb was time-dependent and/or C/(f — t') had extra delta-function peaks (these 
might describe oscillating population activity ), but we have not done so. Thus, 
we cannot exclude the existence of such exotic states, but we can at least check 
whether our asynchronous, irregularly-firing states exist and are stable. 

We can find the mean rates, at least when they are low, independently of 
their fluctuations and the correlation functions: In an irregularly-firing state the 
membrane potential should fluctuate around a stationary value, with the noise 
occasionally and irregularly driving it up to threshold. In mean field theory, we 
have 



at r 



( 4 ) 



where I a (t) is given by (2). (We have dropped the neuron index i, since we are 
now doing a one-neuron problem.) From (2), we see that the leading terms in 
I a (t) are large (oc VKb), so if the membrane potential is to be stationary they 
must nearly cancel: 



2 



Y J J ab\fK'bn = 0{ 1 ). 



( 5 ) 



6= 0 



That is, the mean excitatory ( b = 0, 1) and inhibitory ( b = 2) currents must 
nearly balance. Therefore we call (5) the balance condition. Defining J a b = 
Jabsj Kb/ K q, we can also write it in the form 



2 

/ ] J ab’b’b — 0 . 
b—0 



( 6 ) 
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The external rate ro is assumed known, so these two linear equations can be 
solved for rj,, b = 1,2. We can write the solution as 

2 

^ ^ J ]abJb0^"0, (7) 

6=1 

where by J” 1 we mean the inverse of the 2x2 matrix with elements J a b , a,b = 
1,2. This result was obtained some time ago by Amit and Brunei [25-27] and, 
for a nonspiking neuron model, by van Vreeswijk and Sompolinsky [28,29]. 

However, a complete mean field theory involves the rate fluctuations within 
the populations and the correlation functions, and it is clear that if we want to 
understand something quantitative about the degree of irregularity of the neu- 
ronal firing, it is necessary to do the full theory. This cannot be done analytically, 
so we resort to numerical calculation. 

Numerical Procedure 

Our method was inspired by the work of Eisfeller and Opper [30] on spin glasses. 
They, too, had a mean field problem that could not be solved analytically, so they 
solved numerically the single-spin problem to which mean field theory reduced 
their system. In our case, we have to solve numerically, the problem of a single 
neuron driven by Gaussian random noise, and the crucial part is to make the 
input noise statistics consistent with the output bring statistics. 

This requires an iterative procedure. We have to start with a guess about the 
mean rates, the rate fluctuations, and the correlation functions for the neurons in 
the two populations. We then generate noise according to (2) and simulate many 
trials of neurons driven by realizations of this noise. In these trials, the effective 
numbers of inputs Kb are varied randomly from trial to trial, with a Gaussian 
distribution of width '/Kb, to capture the effects of the random connectivity in 
the network. We compute the bring statistics for these trials and use the result 
to improve our estimate of the input noise statistics. We then repeat the trials 
and iterate the loop until the input and output statistics agree. 

We can get a good initial estimate of the mean rates from the balance condi- 
tion equation (7), but this is harder to do for the rate buctuations and correlation 
function. The method we have used is to do the initial trials with white noise in- 
put (of a strength determined by the mean rates) . There seems to be no problem 
converging to a solution with self consistent rates, rate buctuations and bring 
correlations from this starting point. 

More details of the procedure can be found in [2]. 

Some Results 

As a measure of the bring irregularity, we consider the Fano factor F. It is 
dehned as the ratio of the variance of the spike count to its average, where both 
statistics are computed over a large number of trials. It is easy to relate it to the 
correlation function, as follows. 
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If S(t) is a spike train as in (1), the spike count in an interval from 0 to T is 

d tS{t). (8) 

Its mean is just n = J Q T rdt = rT, and its variance is 

(n - n) 2 = [ T d t [ T dt'([S{t) - r)(S(t') - r}) (9) 

J o Jo 

The quantity in the averaging brackets in (9) is just the correlation function. 
Changing integration variables from (f, t') to (f = ^(t + t’),s = t — t') and taking 
T — > oo gives 

F = 1 + - f dsC(s). (10) 

' J — oo 

For a Poisson process, C(s) = rS(s), leading to F = 1. 

Thus, a Fano factor greater than 1 is not really “more irregular than a Pois- 
son process”, since any deviation of F from 1 comes from some kind of firing 
correlations. 

For this model we have studied how the magnitude of the synaptic strengths 
affects the Fano factor. We have used 




< u) 

In Fig. 1 we plot F as a function of the overall scaling factor J s for three dif- 
ferent values of the relative inhibition strength g. Evidently, increasing synaptic 
strength in either way increases F. 

How can we understand this result? Let us think of the stochastic dynamics 
of the membrane potential u after a spike and reset, as described, for example, 
by a Fokker-Planck equation. Right after reset, the distribution of u is a delta- 
function at the reset level. Then it spreads out diffusively and its center drifts 
toward a quasi-equilibrium level. The speed of the spread and the width of the 
quasi-equilibrium distribution reached after a time ~ r are both proportional to 
the synaptic strength. 

This distribution is only “quasi-equilibrium” because on the somewhat longer 
timescale of the typical interspike interval, significant weight will reach the ab- 
sorbing boundary at threshold. Nevertheless, we can regard it as nearly station- 
ary if the rate is much less than r _1 . 

The center of the quasi-equilibrium distribution has to be at least a standard 
deviation or so below threshold if the neuron is going to fire at a low-to-moderate 
rate. Thus, since this width is proportional to the synaptic strengths, if we fix 
the reset at zero and the threshold at 1 the drift of the distribution after reset 
will be upward for sufficiently weak strengths and downward for strong enough 
ones. Hence, in the weak case, there is a reduced probability of spikes (relative 
to a Poisson process) for times shorter than r, leading to a refractory dip in 
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Fig. 1. Fano factor as a function of the overall synaptic scaling parameter J s , for 3 
values of the relative inhibition parameter g. 

the correlation function and a Fano factor bigger than 1. In the strong-synapse 
case, the rapid initial spread of the membrane potential distribution before it 
has a chance to drift very far downward leads to excess early spikes, a positive 
correlation function at short times, and a Fano factor bigger than 1. The relevant 
ratio is the width of the quasi-equilibrium membrane potential distribution (for 
this model, roughly speaking, J s ) divided by the different between reset and 
threshold. 

The above argument applies even for neurons with white noise input. But 
in the mean field description the firing correlation induced by this effect lead to 
correlations in the input current, which amplify the effects. 

3 Model with Conductance-Based Synapses 

In a second model, we add a touch of realism, replacing the current-based 
synapses by conductance-based ones. Then the postsynaptic potential change 
produced by a presynaptic spike is equal to a strength parameter multiplied by 
the difference between the postsynaptic membrane potential and the reversal 
potential for the class of synapse in question. In addition, we include a simple 
model for synaptic dynamics: we need no longer assume that the postsynaptic 
potential changes instantaneously in response to the postsynaptic spike. 

Now the subthreshold membrane potential dynamics become 

= - g L u»(t ) -££<&(*)«(*) - V b ) (12) 

b=0 j= 1 
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Here gL is a nonspecific leakage conductance (taken in units of inverse time); it 
corresponds to r _1 in (1). The V b are the reversal potentials for the synapses 
from population b ; they are above threshold for excitatory synapses and below 
0 for inhibitory ones, so the synaptic currents are positive (i.e. , inward) and 
negative (outward), respectively, in these cases. The time-dependent synaptic 
conductances g^ b (t) reflect the firing of presynaptic neuron j in population b, 
filtered at its synapse to postsynaptic neuron i in population a: 

&) = 4= f d t'K(t - t')S 3 (t') (13) 

V Afc oo 

when a connection between these neurons is present; otherwise it is zero. (We 
assume the same random connectivity distribution as in the previous model.) 
We have taken the synaptic filter kernel K(t) to have the simple form 

p -t/ T 2 _ p-t/r ! 

m = , (i4) 

r 2 - T\ 

representing an average temporal conductance profile following a presynaptic 
spike, with characteristic opening and closing times T\ and t 2 . This kernel is 
normalized so that f dtK(t) = 1; thus, the total time integral of the conduc- 
tance over the period after an isolated spike is equal to g° b / \ /r K b . Hence, for 
very short synaptic filtering times, this model looks like (1) with a membrane 
potential-dependent </“ fc equal to ga b ( u a(t ) — W) / \fK~b- We take the (dimension- 
less) parameters g® b , like the J ab in the previous model, to be of order 1, so we 
anticipate a large (0(y/ K b )) mean current input from each population b and, 
in the asynchronously-firing steady state, a near cancellation of these separately 
large currents. 

In mean field theory, we have the effective single-neuron equation of motion 

= ~9LU a (t) - ^ 9ab(t)(u a (t) ~ V b ), (15) 

6=0 

in which the total effect of population b on a neuron in population a is a time- 
dependent conductance g a b{t) consisting of a population mean 

(gab) = \/Kbg 0 a b r b , (16) 

static noise of variance 

W = (i - (9°ab) 2 Wtf, (17) 

and dynamic noise with correlation function 

(6g a b(t) Sg ab (t')) = (l - H) (9° ab fC b (t - t'), (18) 

where 

C b (t-t')= j dt\K(t — 1\) f dt 2 K(t' -t 2 )C b (ti,t 2 ) (19) 

J — oo J — OO 

is the correlation function of the synaptically filtered spike trains of population b. 
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The Balance Condition 

As for the model with current-based synapses, we can argue that in an irregu- 
larly, asynchronously-firing state the average du^/dt should vanish. From (15) 
we obtain 

gLUZ + ^2 9abV^ r b(u^ - Vb) = 0. (20) 

b 

Again, for large connectivity the leakage term can be ignored. In contrast to 
what we found in the current-based case, now the balance condition requires 
knowing the mean membrane potential u a- However, we will see that in the 
mean field limit the membrane potential has a narrow distribution centered just 
below threshold. Since the fluctuations are very small, the factor u a — Vb « 9 a — Vb 
in (12) can be regarded as constant, and we are effectively back to the current- 
based model. Thus, defining 

Jaf = 9at(V b - 9 a ), ( 21 ) 

we can just apply the analysis from the current-based case. 



High- Conductance State 

It is useful to measure the membrane potential relative to So, writing u a = 
+ Su a (t) and using the balance condition (20), we find 

i c 

—^2 = —9tot(t)Su a + ^2 S 9ab(t){V b - ( 22 ) 

b 



where 



gtot (t) = gL+ ^2 9ab(t) (23) 

b 

= 9L+^[\2Kb9ab r b + $9ab(t)\, (24) 

b 



with 5 g a b(t) the fluctuating parts of g a b(t), the statistics of which are given 
by (17) and (18). This looks like a simple leaky integrator with current input 
Y^b^ 9 cib{t)(Vb — ujj) and a time-dependent effective membrane time constant 
equal to c/tot (t) 1 - Following Shelley et al. [31], (22) can be further rearranged 
into the form 



dSu a 
d t 



-9tot(t)[5u a - 6V°(t)], 



(25) 



with the “instantaneous reversal potential” (here measured relative to u a ) given 

by 



XV s (+\ — ^b^9ab(t)(Vb U a ) 
aU ~ fftot(i) 



(26) 



Eq. (25) says that at any instant of time, 5u a is approaching < 5V£(t) at a rate 

9tot (t) ■ 
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Fig. 2. The membrane potential u(t) follows the effective reversal potential Vs(t) 
closely, except when V s is above threshold. Here, the threshold is 1 and the reset 



0.94. 



For large Kb, g tot is large (0(y/Kb)), so the effective membrane time constant 
is very small and the membrane potential follows the fluctuating 6V*(t) very 
closely. Fig. 2 shows an example from one of our simulations. This is the main 
qualitative difference between mean field theory for the model with current- 
based synapses and the one with conductance-based ones. It is also the reason 
we introduced synaptic filtering into the present model. In the current-based 
one, the membrane potential filtered the input current with a time constant r 
which we could assume to be long compared with synaptic time constants, so 
we could safely ignore the latter. But here the effective membrane time constant 
becomes shorter than the synaptic filtering times, so we have to retain the kernel 
K(t) (14). Here we have argued this solely from the fact that we are dealing with 
the mean field limit, but Shelley et al. argue that it actually applies to primary 
visual cortex (see also [32]). 

We also observe that in the mean field limit, both the leakage conductance 
and the fluctuation term 5g a b{i) are small in comparision with the mean, so we 
can approximate gtot.it) by a constant: 

9t°t = 'y ' y/ Kbg a b r b- (27) 

b 

Furthermore, the fluctuations 5V* (t) in the instantaneous reversal potential (26) 
are then of order 1 / y/Kb : membrane potential fluctuations can not go far from 
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Ha- But V*(t) must go above threshold frequently enough to produce bring 
at the self-consistent rates. Thus, must lie just a little below threshold, as 
promised above. Hence, at fixed firing rates, the conductance-based problem 
effectively reduces to a current-based one with a very small effective membrane 
time constant r e g = g^ ot and synaptic coupling parameters J®® given by (21). 
Of course, as we increase the firing rates of the external population and thereby 
increase the rates in the network, we will change g to t, making both r e g and the 
fluctuations 6V*(t) correspondingly smaller. 

If we neglected synaptic filtering, the resulting dynamics would be rather 
trivial. It would be self-consistent to take the input current as essentially white 
noise, for then excursions of 5V*(t) above threshold would be be uncorrelated, 
and, since the membrane potential could react instantaneously to follow it up to 
threshold, so would the bring be. (Simulations conbrm this argument.) 

Therefore, the synaptic bitering is essential. It imparts a positive correla- 
tion time to the buctuations 6V£(t), so if it rises above threshold it can stay 
there for a while. During this time, the neuron will bre repeatedly, leading to 
a positive tail in the correlation function for times of the order of the synaptic 
time constants. This broader the kernel K(t — t'), the stronger this effect. In the 
self-consistent description, this effect feeds back on itself: SV*(t) acquires even 
longer correlations, and these lead to even stronger bursty bring correlations. 

Thus, the mean held limit Kb — » oo can be pathological in the conductance- 
based model with synaptic bitering. However, here we take the view that mean 
held theoretical calculations may still give a useful description of real cortical 
dynamics, despite that fact that real cortex is not described by the Kb — * oo 
limit. For example, the true effective membrane time constant is not zero, but, 
according to experiment [32], it is signihcantly reduced from its in vitro value 
by synaptic input, probably [31] to a value less than characteristic synaptic 
bitering times. Doing mean beld theory with moderately, but not extremely 
large connectivities can describe such a state in a natural and transparent way. 

As in the current-based model, the Fano factor grows with the ratio of the 
membrane potential distribution to the threshold reset difference. It also grows 
with increasing synaptic bitering time, as argued above. Fig. 3 shows F plotted 
as a function of 72 , with t\ bxed at 1 ms, for a set of reset values. 



4 Orientation Hypercolumn Model 

Finally, we try to take a step beyond homogeneously random models to describe 
networks with systematic structure in their connections. We consider the ex- 
ample of a hypercolumn in primary visual cortex: a collection of n orientation 
columns, within which each neuron responds most strongly to a stimulus of a par- 
ticular orientation. The hypercolumn contains columns that cover the full range 
of possible stimulus orientations from 0 to n. It is known that columns with simi- 
lar orientation selectivities interact more strongly than those with dissimilar ones 
(because they tend to lie closer to each other on the cortical surface). We build 
this structure into an extended version of our model, which can be treated with 
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Fig. 3. Fano factor as a function of T 2 for 3 reset values. 



essentially the same mean field methods as the simpler, homogeneously random 
one. In the version we present here, we revert to current-based synapses, but it 
is straightforward to construct a corresponding model with conductance-based 
ones. 

A study of a similar model, for non-spiking neurons, was reported by Wolf et 
al. [33] . Those authors have also simulated a network of spiking neurons like the 
one described here [34], complementing the mean-field analysis we give here. 

Each neuron now acquires an extra index 9 labeling the stimulus orientation 
to which it responds most strongly, so the equations of motion for the membrane 
potentials become 



duf 
d t 



a Q 2 n Nb/n 

J i 

6—1 6'=1 j = 1 



a6,b6' nb6 






(28) 



The term /“(0,0o,f) represents the external input current for a stimulus with 
orientation 9q. (In this section we set all thresholds equal to 1, and 9 refers 
only to orientation.) We assume it comes through diluted connections from a 
population of Nq 1 Poisson neurons which fire at a constant rate r o: 

3 



(29) 
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As in the single-column model, we take the nonzero connections to have the 
value 

J-°(Mo) = ^= (30) 

v-Ko 

but now we take the connection probability to depend on the difference between 
9 and 9 0 , according to 

W,0o) = ^[l + ecos2(0-0 o )]. (31) 

tvo 

This tuning is assumed to come from a Hubei- Wiesel feed-forward connectivity 
mechanism. The general form has to be periodic with period n and so would have 
terms proportional to cos 2 m(9 — 9 o) for all m, but here, following Ben-Yishai et 
al. [35], we use the simplest form possible. We have also assumed that the degree 
of tuning, measured by the anisotropy parameter e, is the same for inhibitory 
and excitatory postsynaptic neurons. 

Assuming isotropy, we are free to measure all orientations relative to 9q, so 
we set 9q = 0 from now on. 

Similarly, we take the nonzero intracortical interactions J^’ bS to be 



ij VK~ b 



(32) 



and take the probability of connection to be 

P ab {9-9') = ^[1 + 7 cos 2{9-9 ')\ . (33) 

tv ft 

Analogously to (31), P a b is independent of both the population indices a and b, 
since we always take K^/Nt, independent of b. 

In real cortex, cells are arranged so they generally lie closer to ones of similar 
than to ones of dissimilar orientation preference, and they are more likely to have 
connections with nearby cells than with distant ones. This is the anatomy that 
we model with (33). In [4] and [5] a slightly different model was used, in which the 
connection probability was taken constant but the strength of the connections 
was varied like (33). The equations below for the balance condition and the 
column population rates are the same for both models, but the fluctuations are 
a little different. 

As for the input tuning, the form (33) is just the first two terms in a Fourier 
series, but again we use the simplest possible form for simplicity. 



Balance Condition and Solving for Population Rates 



The balance condition for the hypercolumn model is simply that the net synaptic 
current should vanish for each column 9. Going over to a continuum notation by 
writing the sum on columns as an integral, we get 



V A Ko^ao(l + e cos 2fl)r 0 + y [ — J ab [l + 7 cos 2 ( 6 > - 9')}\/lQ,r b (9') = 0. 

b J ~*/ 2 77 

(34) 
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We have to distinguish two cases: broad and narrow tuning. In the broad case, the 
rates n,(0) are positive for all 9. In the narrowly-tuned case (the physiologically 
realistic one), rb{9) is zero for \9\ greater than some 9 C , which we call the tuning 
width. (In general 9 C could be different for excitatory and inhibitory neurons, 
but with our a-independent e in (31) and a- and 6-independent 7 in (33), it turns 
out not to.) 

In the broad case the integral over 9' can be done trivially with the help of 
the trigonometric identity cos( A — B) = cos A cos B + sin A sin B and expanding 
rb{9') = rb,o + cos 20' + •• •. We find that the higher Fourier components 
fb, 2 rm m > 1, do not enter the result: 

\fK ~0 Jao(l + ecos20)r o + ^ + \l r b ,2 cos 26) = 0. (35) 

b 



If (35) is to hold for every 0 , the constant piece and the part proportional to 
cos 26 both have to vanish: for each Fourier component we have an equation like 
(5). Thus we get a pair of equations like (7): 



2 

^a,0 ^ ^ [J \abJbO^O 

b—1 



2,2 = -— VP p abJbo r 0 (= —Vo) , (36) 

-W £ sy 



6=1 



where J a b = J a b\/Kb/Ko , as in the simple model. 

This solution is acceptable only if e < y/2, since otherwise r a (9) will be 
negative for 9 > \ cos -1 (—r a ,o/r a , 2 ) ■ 

Therefore, for e > 7/ 2, we make the ansatz 

r a($) = 77,2 (cos 2$ — cos 2 9 C ) (37) 

(i.e., we write r a> o as — r a) 2cos20 c ) for \9\ < 9 C and r a {9) = 0 for |0| > 9 C . We 
put this into the balance condition (34). Now the integrals run from —9 C to 9 C , 
so they are as trivial as in the broadly-tuned case, but the ansatz works and we 
find 



Jaoro + ^2 JabnafoiPc) = 0, eJ a0 r 0 + 7^ JabTbahiOc) = 0, (38) 

b 



b 



where 



/o(0c) = ~ (sin 29 c - 29 c cos20 c ), f 2 (9 c ) = ±(0 C - ±sin40 c ), (39) 

(The algebra here is essentially the same as that in a different kind of model 
studied by Ben-Yishai et al. [35]; see also [36].) 

Eqns. (38) can be solved for 9 C and r a> 2, a = 1,2. Dividing one equation by 
the other leads to the following equation for 9 C : 

h(9 c ) e 
fo(9 c ) 7' 



(40) 
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Then one can use either of the pairs of equations (38) to find the remaining 



unknowns r a , 2 '- 




E 0- 1 }abJbO'ro- 

b 



(41) 



The function f2(9 c ) / fo(9 c ) takes the value 1 at 9 C = 0 and falls monotonically 
to \ at 9 C — 7t/ 2. Thus, a solution can be found for i < e/7 < 1. For e/7 — > 

0 q — + 7r/2 and we go back to the broad solution. For e/7 — » 1, 0 C — » 0: the tuning 
of the rates becomes infinitely narrow. Note that stronger tuning of the cortical 
interactions (bigger 7) leads to broader orientation tuning of the cortical rates. 
This possibly surprising result can be understood if one remembers that the 
cortical interactions (which are essentially inhibitory) act divisively (see, e.g., 
(36) and (41)). 

Another feature of the solution is that, from (40), the tuning width does 
not depend on the input rate ro, which we identify with the contrast of the 
stimulus. Thus, in the narrowly-tuned case, the population rates in this model 
automatically exhibit contrast-invariant tuning, in agreement with experimental 
findings [37]. We can see that this result is a direct consequence of the balance 
condition. 



However, we should note that individual neurons in a column will exhibit 
fluctuations around the mean tuning curves which are not negligible, even in 
the mean-field limit. These come from the static part of the fluctuations in the 
input current (like the second term in (2) for the single-column model), which 
originate from the random inlromogeneity of the connectivity in the network. 

As for the single-column model, the full solution, including the determination 
of the rate fluctuations and correlation functions, has to be done numerically. 
This only needs a straightforward extension of the iterative procedure described 
above for the simple model. 



Tuning of Input Noise 

We now consider the tuning of the dynamic input noise. Using the continuum 
notation, we get input and recurrent contributions adding up to 

(8I a (0,t)5I a (0,t')) = Jf 0 ( 1 + ecos2 9)r 0 6(t - t') 

/ 7T inf 

—Jl b { 1 + 7COS2(0 - 9')]C b (9',t-t'), (42) 

-7T K 

where Cb(9' ,t — t') is the correlation function for population b in column 9' . We 
can not proceed further analytically for t t ' , since this correlation function has 
to be determined numerically. But we know that for an irregularly firing state 
Cb(0, t — t') always has a piece proportional to rb(9)6(t — t'). This, together with 
the external input noise, gives a flat contribution to the noise spectrum of 

lim (\5I a (9,u>)\ 2 ) = J^ o (l + ecos20)ro + Y/ f — J^ b [l + ^ cos 2(9-9' ))r b (9'). 

(43) 
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Fig. 4. Fano factor as a function of orientation 9 for 3 values of J 3 . 



The integrals on O' are of the same kind we did in the calculation of the rates 
above, so we get 

lim (\5I a (0,uj)\ 2 ) = J^ 0 (l + ecos20)r 0 + y^ J^ b r ba [fo(O c ) + -/f 2 (0 c ) cos29] 

b 

= r o( l + ecos26>)(J^ 0 - JlbO^bcJco), (44) 

be 

where we have used (40) and (41) to obtain the last line. Thus, the recurrent 
synaptic noise has the same orientation tuning as that from the external input, 
unlike the population firing rates, which are narrowed by the cortical interac- 
tions. 



Output Noise: Tuning of the Fano Factor 

To study the tuning of the noise in the neuronal firing, we have to carry out the 
full numerical mean field computation. Fig. 4 shows results for the tuning of the 
Fano factor with 0 , for three values of the overall synaptic strength factor J s . 
For small J s there is a minimum in F at the optimal orientation (0), while for 
large J s there is a maximum. It seems that for any J s , F is either less than 1 
at all angles or greater than 1 at all angles; we have not found any cases where 
the firing changes from subpoissonian to superpoissonian as the orientation is 
varied. 
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5 Discussion 

These examples show the power of mean field theory in studying the dynamics of 
dense, randomly-connected cortical circuits, in particular, their bring statistics, 
described by the autocorrelation function and quantities derived from it, such 
as the Fano factor. One should be careful to distinguish this kind of mean field 
theory from ones based on “rate models”, where a function giving the bring 
rate as a function of the input current is given by hand as part of the model. 
By construction, those models can not say anything about bring statistics. Here 
we are working at a more microscopic level, and both the equations for the 
bring rates and the bring statistics emerge from a self-consistent calculation. We 
think it is important to do a calculation that can tell us something about bring 
statistics and correlations, since the irregular bring of cortical neurons is a basic 
experimental fact that should be explained, preferably quantitatively, not just 
assumed. 

We were able to see in the simplest model described here how this irregularity 
emerges in mean held theory, provided cortical inhibition is strong enough. This 
conbrms results of [25-27], but extends the description to a fully self-consistent 
one, including correlations. 

It became apparent how the strength of synapses and the post-spike reset 
level controlled the gross characteristics of the bring statistics, as measured by 
the Fano factor. A high reset level and/or strong synapses result in an enhanced 
probability of a spike immediately after reset, leading to a tendency toward 
bursting. Low reset and/or weak synapses have the opposite ebect. 

Visual cortical neurons seem to show typical Fano factors generally some- 
what above the Poisson value of 1. They have also been shown to have very 
high synaptic conductances under visual stimulation. Our mean-beld analysis 
of the model with conductance-based synapses shows how these two observed 
properties may be connected. 

In view of the large variation of Fano factors that there could be, it is perhaps 
remarkable that observed values do not vary more than they do. We like to 
speculate about this as a coding issue: Any constrained bring correlations imply 
reduced freedom to encode input variations, so information transmission capacity 
is maximized when correlations are minimized. Thus, plausibly, evolutionary 
pressure can be expected to keep synaptic strengths in the right range. 

Finally, extending the description from a single “cortical column” to an array 
or orientation columns forming a hypercolumn provided a way of understanding 
the intracortical contribution to orientation tuning, consistent with the basic 
constraints of dominant inhibition, irregular bring and high synaptic conduc- 
tance. 

These issues can also be addressed directly with large-scale simulations, as 
in [31]. However mean held theory can give some clearer insight (into the re- 
sults of such simulations, as well as of experiments), since it reduces the network 
problem to a single-neuron one, which we have a better chance of understand- 
ing. So we think it is worth doing mean held theory even when it becomes as 
computationally demanding as direct network simulation (as it does in the case 




John Hertz, Alexander Lerchner, and Mandana Ahmadi 



of the orientation hypercolumn [34]). At the least, comparison between the two 
approaches can allow one to identify clearly any true correlation effects, which 
are, by definition, not present in mean field theory. 

Much more can be done with mean field theory than we have described here. 
First, as mentioned above, it can be done for any kind of neuron, even a multi- 
compartment one, not just a point integrate-and-fire one. At the level of the 
connectivity model, the simple model described by (33) can also be extended 
to include more details of known functional architecture (orientation pinwheels, 
layered structure, etc.). It is also fairly straightforward to add synaptic dynamics 
(not just the average description of opening and closing of the channels on the 
postsynaptic side described by the kernel I\(t — t') in (13)). One just has to add 
a synaptic model which takes the spikes produced by our single effective neuron 
as input to a synaptic model. Thus, the path toward including more potentially 
relevant biological detail is open, at non-prolribitive computational cost. 
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Abstract. Neuroscience studies how a large collection of coupled neu- 
rons combines external data with internal memories into coherent pat- 
terns of meaning. Such a process is called “feature binding” , insofar as the 
coherent patterns combine together features which are extracted sepa- 
rately by specialized cells, but which do not make sense as isolated items. 
A powerful conjecture, with experimental confirmation, is that feature 
binding implies the mutual synchronization of axonal spike trains in neu- 
rons which can be far away and yet contribute to a well defined perception 
by sharing the same time code. 

Based on recent investigations of homoclinic chaotic systems, and how 
they mutually synchronize, a novel conjecture on the dynamics of the sin- 
gle neuron is formulated. Homoclinic chaos implies the recurrent return 
of the dynamical trajectory to a saddle focus, in whose neighbourhood 
the system susceptibility (response to an external perturbation) is very 
high and hence it is very easy to lock to an external stimulus. Thus ho- 
moclinic chaos appears as the easiest way to encode information by a 
train of equal spikes occurring at erratic times. 

In conventional measurements we read the number indicated by a meter’s 
pointer and assign to the measured object a set position corresponding 
to that number. On the contrary, a time code requires a decision time T 
sufficiently longer than the minimal interspike separation h, so that the 
total number of different set elements is related in some way to the size 
j-. In neuroscience it has been shown that T ~ 200?ns while t\ ~ 3 ms. 
In a sensory layer of the brain neocortex an external stimulus spreads 
over a large assembly of neurons building up a collective state, thus 
synchronization of trains of different individual neurons is the basis of a 
coherent perception. 

The percept space can be given a metric structure by introducing a dis- 
tance measure. This distance is conjugate of the duration time in the 
sense that an uncertainty relation is associated with time limited per- 
ceptions. If the synchronized train is truncated at a time AT < T, then 
the corresponding identification of a percept P carries an uncertainty 
cloud AP. As two uncertainty clouds overlap, interference occurs; this 
is a quantum behavior. Thus the quantum formalism is not exclusively 
limited to microscopic phenomena, but here it is conjectured to be the 
appropriate description of truncated perceptions. 

This quantum feature is not related to Planck’s action but to the details 
of the perceptual chain. 
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1 Homoclinic Chaos and Its Biological Relevance 

Homoclinic chaos (HC) appears as the optimal strategy for a time code shared by 
a large crowd of identical coupled objects. Indeed HC provides at each pseudo- 
period (or Inter Spike Interval = ISI) the alternation of a regular large spike 
(a) and a small chaotic background (b). (b) is the sensitive region where the 
activation from neighbors occurs, while the spike (a) provides a suitable signal 
to activate the coupling. Whence, a chain of weakly coupled objects of this kind 
will easily synchronize, reaching a state common to all sites. 
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Fig. 1. (a) Experimental time series of the laser intensity for a C02 laser with feedback 
in the regime of liomoclinic chaos, (b) Time expansion of a single orbit, (c) Phase space 
trajectory built by an embedding technique with appropriate delays [from [1]]. 



An easy explanation of such a behavior is offered by the experimental results 
presented in Fig. 1, showing the alternation of large spikes and a small chaotic 
background. Rather than the time series (Fig. la), b) ) it is convenient to build 
the three dimensional (3D) phase space trajectory by an embedding technique 
(Fig. lc). For pedagogical reasons, we extract the schematic behavior in Fig. 2. 
In the course of the orbit, the phase point approaches the saddle focus S with 
a contraction rate a and then escapes from S with an expansion rate 7 . In 3D 
the escape rate may be complex 7 ± iui providing oscillations. Whenever 7 > u> 
we have the so called Slrilnikov chaos [3] consisting of almost identical peaks 
P separated by a chaotic background. In that background, the proximity to 
the saddle focus S implies a large susceptibility, that is, a large sensitivity to a 
forcing stimulus, thus making HC prone to lock to external signals. Fig. 2 lists 
some situations in which this locking is useful, they will be discussed in the next 
Section. 
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Fig. 2. Schematic view of the phase space trajectory approaching the saddle S and 
escaping from it. Chaos is due to the shorter or longer permanence around S; from a 
geometrical point of view most of the orbit P provides a regular spike. 



Thus the first key point of this paper is that an array of weakly coupled 
HC systems represents the simplest model for a physical realization of feature 
binding. Various experiments [14, 22] prove that a decision is taken after 200ms 
of exposure to a sensory stimulus. As we elaborate in Sec. 5, synchronizing 
over 200ms trains of spikes with a minimal interspike separation of 3 ms and 
an average repetition of 25 ms (the so called gamma band in the EEG jargon) 
corresponds to a repertoire of about 10 11 different words. T ~ 200ms seems to 
be the time necessary to localize an element in the set of 10 11 virtual sensations 
and whence activate some action. 

How to make use of the synchronization pattern: assume that a sensory layer 
(visual, auditory, etc. ) receives an external stimulus; it is pointless to recur to 
an hypothetical “ homunculus ” sitting within the brain, reading the synchroniza- 
tion state and acting consequently, activating motor actions as escaping from a 
predator, catching a prey, or speaking about [16]. We propose a simple model 
based on available brain components, consisting of coding neurons as described 
above and a reading system which localizes the boundaries of the synchronized 
domains and decides which is the prominent one, based on a “majority rule” . 

The appropriate algorithm for extracting the time code is the Wigner func- 
tion [28] which was introduced in 1932 in quantum mechanics. It compares a 
signal with a shifted version of itself and sums over all shifts with a phase factor 
which yields a Fourier transform. The time code is activated on the largest syn- 
chronized pattern established on the sensory layer; if there are different stimuli 
the time coding must be preceded by a space coding which evaluates which one 
is the winning signal. This space coding is also reasonably performed by a space 
Wigner function, as discussed later below. Based on the above considerations, 
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we introduce the second key point of percept formation, namely, a quantum 
limitation in information encoding/decoding through spike trains, whenever the 
processing session is interrupted. In fact, the temporal coding requires a suffi- 
ciently long sequence of synchronized spikes, in order to realize a specific percept. 
If the sequence is interrupted by the arrival of new uncorrelated stimuli, then 
an uncertainty AP emerges in the percept space P. This is related to the finite 
duration AP allotted for the code processing by the uncertainty relation 

AP ■ AT >C 

where C represents a quantum constraint on the coding. This means that the 
percepts are not set-theoretical objects, that is, objects belonging to separate 
domains, but there are overlap regions where it is impossible to discriminate one 
percept from another. 

In conclusion, we will elaborate the following points: 

1. Chaos, precisely HC, provides an efficient time code for classifying different 
percepts 

2. The code is read via a Wigner distribution; this provides quantum interfer- 
ence between different percepts 

3. The physiological decision time T ~ 200ms probably corresponds to deleting 
the interference; it acts as the decoherence time in quantum mechanics and a 
quantum effect can be seen when the perceptual task is truncated to AT < T. 

2 Synchronization of Homoclinic Chaos 

A wide class of sensory neurons demonstrates lromoclinic chaotic spiking activity 

[ 10 ]- 

Precisely, a saddle focus instability separates in parameter space an excitable 
region, where axons are silent (Fig. 4b), from a periodic region, where the spike 
train is periodic (equal interspike intervals (Fig. 4a)). If a control parameter is 
tuned at the saddle focus, the corresponding dynamical behavior (homoclinic 
chaos) consists of a frequent return to the instability [1]. This manifests as a 
train of geometrically identical spikes, which however occur at erratic times 
(chaotic interspike intervals). Around the saddle focus the system displays a 
large susceptibility to an external stimulus, hence it is easily adjustable and 
prone to respond to an input, provided this is at sufficiently low frequencies; this 
means that such a system is robust against high frequency noise as discussed 
later. 

Such a type of dynamics has been recently dealt with in a series of reports 
that here I recapitulate as the following chain of linked facts. 

1. A single spike in a 3D dynamics corresponds to a quasi-lromoclinic trajectory 
around a saddle focus SF; the trajectory leaves the saddle and returns to 
it (Fig. 1,2). We say “quasi-homoclinic” because, in order to stabilize the 
trajectory away from SF, a second fixed point, namely a saddle node SN, is 
necessary to assure a lreteroclinic connection. 
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Fig. 3. Comparison between HC(homoclinic chaos) (left column) and Lorenz chaos 
(right column). The top row is a phase space projection over two dynamical variables; 
SF denotes the saddle focus and SN the saddle node, on the left the two SF map one 
onto the other after an inversion (x ~l x, y ~l y) around the origin. The intermediate 
row shows the time series for variables xl of HC (it represents the laser intensity in the 
case of the C02 laser) and x of Lorenz; in the former case, a suitable threshold cuts on 
the chaotic background; in the latter case, no convenient region for thresholding can 
be isolated. Bottom row, left : after threshold, the new variable S(t) alternates spikes 
with flat regions where the systems has a high sensitivity and short refractive windows 
where the intensity S(t) goes to zero. 
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Fig. 4. Stepwise increase a) and decrease b) of control parameter Bo by pml brings the 
system from homoclinic to periodic or excitable behavior, c) In case a) the frequency 
r of the spikes increases monotonically with Bq [from [18]]. 
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2. The homoclinic chaos (HC) reported in Fig. 1 and Fig. 2 has a localized 
chaotic tangle surrounded by an island of stability (Fig. 3 -left). This provides 
a sensitivity region corresponding to the chaotic tangle and an active region 
corresponding to the identical spikes. Such a behavior is crucial to couple 
large arrays of HC systems, as it occurs in the neuron coupling in the brain 
[6]. On the contrary, standard chaos as Lorenz where the chaotic behavior 
fills the whole attractor is inconvenient for synchronization purposes (Fig. 3 
- right). 

3. A train of spikes corresponds to the sequential return to, and escape from, 
the SF. A control parameter can be fixed at a value BC for which this return 
is erratic (chaotic interspike interval) even though there is a finite average 
frequency. As the control parameter is set above or below Be, the system 
moves from excitable (single spike triggered by an input signal) to periodic 
(yielding a regular sequence of spikes without need for an input), with a fre- 
quency monotonically increasing with the separation AB from Be (Fig. 4). 
A low frequency modulation of B around Be provides alternation of silent 
intervals with periodic bursts. Such bursting behavior, peculiar of neuron 
on CPG (Central Pattern Generators) as the cardio-respiratory system, has 
been modeled by a laser experiment [18]. 

4. Around SF, any tiny disturbance provides a large response. Thus the ho- 
moclinic spike trains can be synchronized by a periodic sequence of small 
disturbances (Fig. 5). However each disturbance has to be applied for a min- 
imal time, below which it is no longer effective; this means that the system is 
insensitive to broadband noise, which is a random collection of fast positive 
and negative signals [29]. 
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Fig. 5. Experimental time series for different synchronization induced by periodic 
changes of the control parameter, (a) 1:1 locking, (b) 1:2, (c) 1:3, (d) 2:1 [from [1]]. 
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5. The above considerations lay the floor for the use of mutual synchronization 
as the most convenient way to let different neurons respond coherently to 
the same stimulus, organizing as a space pattern. In the case of a single dy- 
namical system, it can be fed back by its own delayed signal (DSS = delayed 
self synchronization). As the delay is long enough the system is decorrelated 
with itself and this is equivalent to feeding an independent system. This pro- 
cess allows to store meaningful sequences of spikes as necessary for a short 
term memory [5] . 

6. A dynamical sketch of a single neuron is presented in Fig. 6 and its HC 
behavior is discussed e.g in [10]. The feature binding conjecture, based on 
synchronization of different neurons exposed to the same image, is shown in 
Fig. 7 [23] and the role of threshold resetting due to past memories (ART= 
Adaptive Resonance Theory [8]) is illustrated in Fig. 8. 

7. In the presence of localized stimuli over a few neurons, the corresponding 
disturbances propagate by inter-neuron coupling (either excitatory or in- 
hibitory); a synchronized pattern is uniquely associated with each stimulus; 
different patterns compete and we conjecture that the resulting sensory re- 
sponse, which then triggers motor actions, corresponds by a majority rule 
to that pattern which has extended over the largest cortical domain. An ex- 
ample is discussed for two inputs to a one-dimensional array of coupled HC 
systems [13]. 
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Fig. 6. Dynamical behavior of a single neuron. The sum of the inputs compares with 
a threshold level; if it is below (a) only a few noisy spikes occur as electrical activity 
on the axon; if it is just above (b) we have a periodic spike train, whose frequency 
increases as the input goes high above threshold (c). 
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Fig. 7. Feature binding: the lady and the cat are respectively represented by the mo- 
saic of empty and filled circles, each one representing the receptive field of a neuron 
group in the visual cortex. Within each circle the processing refers to a specific detail 
(e.g. contour orientation). The relations between details are coded by the temporal 
correlation among neurons, as shown by the same sequences of electrical pulses for 
two filled circles or two empty circles. Neurons referring to the same individual (e.g. 
the cat) have synchronous discharges, whereas their spikes are uncorrelated with those 
referring to another individual (the lady) [from [23]]. 



Fig. 8. ART = Adaptive Resonance Theory. Role of bottom-up stimuli from the early 
visual stages an top-down signals due to expectations formulated by the semantic 
memory. The focal attention assures the matching (resonance) between the two streams 
[from cite J ulesz] . 
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These facts have been established experimentally and confirmed by a conve- 
nient model in the case of a class B laser 1 with a feedback loop which readjusts 
the amount of losses depending on the value of the light intensity output [3] . 

The above listed facts hold in general for any dynamical system which has 
a 3-dimensional sub-manifold separating a region of excitability from a region 
of periodic oscillations: indeed, this separatrix has to be a saddle focus. If a 
dynamical system has N > 3 dimensions, then a local stability analysis shows 
that around a saddle focus the systems behaves locally as 3D, since N-3 variation 
rates are largely negative. 

In particular all biological clocks are of HC typein order to have an adaptive 
period which can be re-adjusted whenever necessary (think e.g. of the cardiac 
pace-maker). 



3 Perceptions and Feature Binding 

Let us return to the visual system; the role of elementary feature detectors has 
been extensively studied in the past decades [9]. By now we know that some 
neurons are specialized in detecting exclusively vertical or horizontal bars, or 
a specific luminance contrast, etc. However the problem arises: how elementary 
detectors contribute to a holistic (Gestalt) perception? A hint is provided by [23]. 
Suppose we are exposed to a visual field containing two separate objects. Both 
objects are made of the same visual elements, horizontal and vertical contour 
bars, different degrees of luminance, etc. What are then the neural correlates of 
the identification of the two objects? We have one million fibers connecting the 
retina to the visual cortex. Each fiber results from the merging of approximately 
100 retinal detectors (rods and cones) and as a result it has its own receptive 
field. Each receptive field isolates a specific detail of an object (e.g. a vertical 
bar). We thus split an image into a mosaic of adjacent receptive fields. 

Now the “feature binding” hypothesis consists of assuming that all the cor- 
tical neurons whose receptive fields are pointing to a specific object synchronize 
the corresponding spikes, and as a consequence the visual cortex organizes into 
separate neuron groups oscillating on two distinct spike trains for the two ob- 
jects. 

Direct experimental evidence of this synchronization is obtained by insertion 
of microelectrodes in the cortical tissue of animals just sensing the single neuron 
(Fig. 6) [23]. Indirect evidence of synchronization has been reached for human 
beings as well, by processing the EEG (electro-enceplralo-gram) data [22]. 

1 I recall the classification widely accepted in laser physics. Class A lasers are ruled 
by a single order parameter, the amplitude of the laser held, which obeys a closed 
dynamical equation; all the other variables having much faster decay rate, thus 
adjusting almost instantly to the local held value. Class B lasers are ruled by two 
order parameters, the laser Held and the material energy storage providing gain; 
the two degrees of freedom having comparable characteristic times and behaving as 
activator and inhibitor in chemical dynamics [2] 
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The advantage of such a temporal coding scheme, as compared to traditional 
rate based codes, which are sensitive to the average pulse rate over a time interval 
and which have been exploited in communication engineering, has been discussed 
in a recent paper [24] . 

Based on the neurodynamical facts reported above, we can understand how 
this occurs [8]. The higher cortical stages where synchronization takes place 
have two inputs. One (bottom-up) comes from the sensory detectors via the 
early stages which classify elementary features. This single input is insufficient, 
because it would provide the same signal for e.g. horizontal bars belonging indif- 
ferently to either one of the two objects. However, as we said already, each neuron 
is a nonlinear system passing close to a saddle point, and the application of a 
suitable perturbation can stretch or shrink the interval of time spent around the 
saddle, and thus lengthen or shorten the interspike interval. The perturbation 
consists of top-down signals corresponding to conjectures made by the semantic 
memory (Fig. 7). 

In other words, the perception process is not like the passive imprinting 
of a camera film, but it is an active process whereby the external stimuli are 
interpreted in terms of past memories. A focal attention mechanism assures that 
a matching is eventually reached. This matching consists of resonant or coherent 
behavior between bottom-up and top-down signals. If matching does not occur, 
different memories are tried, until the matching is realized. In presence of a fully 
new image without memorized correlates, then the brain has to accept the fact 
that it is exposed to a new experience. 

Notice the advantage of this time dependent use of neurons, which become 
available to be active in different perceptions at different times, as compared to 
the computer paradigm of fixed memory elements which store a specific object 
and are not available for others (the so called “grandmother neuron” hypothesis) . 

We have above presented qualitative reasons why the degree of synchroniza- 
tion represents the perceptual salience of an object. Synchronization of neurons 
located even far away from each other yields a space pattern on the sensory 
cortex, which can be as wide as a few square millimeters, involving millions 
of neurons. The winning pattern is determined by dynamic competition (the 
so-called “winner takes all” dynamics). 

This model has an early formulation in ART and has been later substantiated 
by the synchronization mechanisms. Perceptual knowledge appears as a complex 
self-organizing process. 

4 The Role of Duration T in Perception: 

A Quantum Aspect 

How does a synchronized pattern of neuronal action potentials become a relevant 
perception? This is an active area of investigation which may be split into many 
hierarchical levels. Notice however that, due to the above consideration on qualia, 
the experiments suggested in this section hold for purely a-semantic perceptions; 
by this we mean the task of detecting rather elementary features which would 
not trigger our categorization skills. 
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Not only the different receptive fields of the visual system, but also other 
sensory channels as auditory, olfactory, etc. integrate via feature binding into 
a holistic perception. Its meaning is “decided” in the PFC (pre-frontal cortex) 
which is a kind of arrival station from the sensory areas and departure for signals 
going to the motor areas. On the basis of the perceived information, motor 
actions are started, including linguistic utterances [22]. 

Sticking to the neurodynamical level, and leaving to psychophysics the in- 
vestigation of what goes on at higher levels of organization, we stress here a 
fundamental temporal limitation. 

Taking into account that each spike lasts about 1ms, that the minimal inter- 
spike separation is 3 ms, and that the decision time at the PCF level is estimated 
as T ~ 200ms, we can split T into 200/3 = 66.6 bins of 3ms duration, which 
are designated by 1 or 0 depending on whether they have a spike or not. Thus 
the a priori total number of different messages which can be transmitted is 

2 66 ~ 6 • 10 19 

However we must account also for the average rate at which spikes proceed in 
our brain, which is r = 40 Hz (so called 7 band, average I SI = 25 ms). When we 
account for this rate we can evaluate a reduction factor a = ^ = 0.54 where S is 
an entropy [21,25,15], thus there are roughly 2 s ~ 10 11 words with significant 
probability. Even though this number is large, we are still within a finitistic 
realm. Provided we have time enough to ascertain which one of the different 
messages we are dealing with, we can classify it with the accuracy of a digital 
processor, without residual error. 

But suppose we expose the cognitive agent to fast changing scenes, for in- 
stance by presenting in sequence unrelated video frames with a time separation 
less than 200ms. While small gradual changes induce the sense of motion as in 
movies, big differences imply completely different subsequent spike trains. Here 
any spike train gets interrupted after a duration AT less than the canonical T. 
This means that the brain cannot decide among all coded perceptions having 
the same structure up to DT, but different afterwards. 

Whenever we stop the perceptual task at a time AT shorter than the total 
time T, then the bin stretch T — AT (from now on we measure the times in 
bin units) is not explored. This means that all stimuli which provide equal spike 
sequences up to AT, and differ afterwards will cover an uncertainty region AP 
whose size is given by 



AP = 2 aT 2~ aAT = P M e~ aATln2 (1) 

where Pm ~ 10 11 is the maximum perceptual size available with the chosen 
T ~ 66.6 bins per perceptual session and rate r = 40 Hz. Relation (1) is very 
different from the standard uncertainty relation 



AP- AT =C 



(2) 



that we would expect in a word-bin space ruled by Fourier transform relations. 
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Indeed, the trascendental equation (1) is more rapidly converging at short 
and long AT than the hyperbola (2). We fit (1) by (2) in the neighborhood of a 
small uncertainty AP = 10 words, which corresponds to AT = 62 bins. Around 
AT = 62 bins the local uncertainty (2) yields a quantum constant 

C = 10 • 62 = 620 words * bins (3) 

To convert C into Js as Planck’s h, consider that: 

1. 1 bin = 3 ms 

2. in order to jump from an attractor corresponding to one perception to a 
nearby one, a minimal amount of energy is needed; we conjecture that this 
roughly 10 ksT , that is, neither too small as ksT , otherwise jumps would 
occur spontaneously at room temperature, nor too high as leV, otherwise 
our brain would need a large amount of energy per perception. For 10 8 
neurons in the visual cortex this makes 4 • 10 ~ 12 J. Thus we estimate 



C = 6 • 10" 12 Js = 10 22 h 



The Wigner function formalism derives from a Sclrroedinger wavefunction 
treatment for a pure state, and corresponding density matrix for mixed states. 

In the perceptual (P, T) space no Sclrroedinger treatment is available, but 
we can apply a reverse logical path as follows. 

The uncertainty relation AP ■ AT > C forbids a partition of the (P, T ) space 
into sets only if the (P, T) space is non commutative. Thus it must be susceptible 
of a Wigner function treatment and we can consider the contours of Fig. 10 as 
fully equivalent to isolevel cuts of a Wigner function. Hence we can consider 
interference of distinct states as in quantum physics. 

This means that in neurophysics time occurs under two completely different 
meanings, that is, as the ordering parameter to classify the position of successive 
events and as the useful duration of a relevant spike sequence, that is the duration 
of a synchronized train. In the second meaning, time T is a variable conjugate 
to perception P. 

The quantum character has emerged as a necessity from the analysis of an 
interrupted spike train in a perceptual process. It follows that the (P, T) space 
cannot be partitioned into disjoint sets to which a Boolean yes/not relation is 
applicable and hence where ensembles obeying a classical probability can be 
considered. A set-theoretical partition is the condition to apply the Church- 
Turing thesis, which establishes the equivalence between recursive functions on 
a set and operations of a universal computer machine. 

The evidence of interfering perceptions should rule out in principle a finitistic 
character of the perceptual processes. This should be the negative answer to 
the Turing 1950 question whether the mental processes can be simulated by a 
universal computer [26] . 

Among other things, the characterization of the “concept” or “category” as 
the limit of a recursive operation on a sequence of individual related perceptions 
gets rather shaky, since recursive relations imply a set structure. 
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Quantum limitations were also put forward by Penrose [20] but on a com- 
pletely different basis. In his proposal, the quantum character was attributed to 
the physical behavior of the “microtubules” which are microscopic components 
of the neurons playing a central role in the synaptic activity. However, speaking 
of quantum coherence at the h-bar level in biological processes is not plausible, 
if one accounts for the extreme vulnerability of any quantum system due to 
“decoherence” processes, which make quantum superposition effects observable 
only in extremely controlled laboratory situations, and at sub-picosecond time 
ranges, not relevant for synchronization purposes in the 10 — 100ms range. 

Our tenet is that the quantum C — level in a living being emerges from the 
limited time available in order to take vital decisions ;it is logically based on a 
non-commutative set of relevant variables and hence it requires the logical ma- 
chinery built for the h-bar quantum description of the microscopic world where 
non-commutativity emerges from use of variables coming from macroscopic expe- 
rience, as coordinate and momenta, to account for new facts. The recent debate 
[19] has clearly excluded the need for hidden variables and hence the possibility 
of considering the quantum formalism as a projection of a more general classical 
one. Similarly, in our case, the fact that decisions are taken on, say, a 100ms se- 
quence of neuronal spikes, excludes a cognitive world where longer decision times 
(say, 300ms) would disentangle overlapping states. In other words, the universe 
of perceptual 100ms events would be correctly described by a C — quantum for- 
malism, since the 300ms perceptual facts would be considered not accessible in 
principle. 

The qualitative uncertainty considerations have been taken as “quantum” by 
postulating an interference within the overlapping region. A more precise consid- 
eration consists in classifying the spike trains. Precisely, if we have a sequence of 
identical spikes of unit area localized at erratic time positions r; then the whole 
sequence is represented by 



m = E l S{t-n)) (4) 

where {ti} is the set of position of the spikes. A temporal code, based on the 
mutual position of successive spikes, depends on the moments of the interspike 
interval distributions 

(ISI)i = {ti - ti- i} (5) 

Different ISIs encode different sensory information. 

A time ordering within the sequence (3) is established by comparing the 
overlap of two signals as (3) mutually shifted in time. Weighting all shifts with a 
phase factor and summing up, this amounts to constructing a Wigner function 
[28,17] 

/ +oo 

f(t + r/2)f(t — t /2)exp(iu>r)dT ( 6 ) 

-OO 

If now f is the sum of two packets f = fi + ji as in Fig. 9, the frequency-time 
plot displays an intermediate interference. Eq. (5) would provide interference 
whatever is the time separation between f\ and fa. 
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In fact, we know that the decision time T truncates a perceptual task, thus 
we must introduce a cutoff function d(r) ~ exp(—r 2 /T 2 ) which transforms the 
Wigner function as 

/ +oo 

f(t + T/2)f(t - T/2)exp(iuT)g(T)dT (7) 

-OO 

In the quantum jargon, the brain physiology breaks the quantum interference 
for t > T ( decoherence phenomenon ). 

5 Who Reads the Information Encoded 
in a Synchronized Neuron Array: 

The Role of the Wigner Function in Brain Operations 

We have seen that feature binding in perceptual tasks implies the mutual syn- 
chronization of axonal spike trains in neurons which can be even far away and 
yet contribute to a well defined perception by sharing the same pattern of spike 
sequence. 

The synchronization conjecture, formulated by many people as e. g. Von der 
Marlsburg in 1976, was later given direct experimental evidence by inserting 
several micro-electrodes probing each one single neuron in the cortex of cats 
and then studying the temporal correlation in response to specific visual in- 
puts(Singer et al).In the human case, indirect evidence is acquired by exposing a 
subject to transient patterns and reporting the time-frequency plots of the EEG 
signals [22]. Even though the space resolution is poor, phase relations among 
EEG signals coming from different cerebral areas at different times provide an 
indirect evidence of the synchronization mechanism. 

Recently the dynamics of lromoclinic chaos (HC) has been explored with 
reference to some laboratory phenomena observed in lasers and then explored 
in its mathematical aspects, which display strong analogies with the dynamics 
of many biological clocks, in particular that of a model neuron (Hodgkin and 
Huxley: see [10]). HC provides almost equal spikes occurring at variable time 
positions and presents a region of high sensitivity to external stimuli pertur- 
bations arriving within the sensitivity window induce easily a synchronization, 
either to an external stimulus or to each other (mutual synchronization ) in the 
case of an array of coupled HC individuals (from now on called neurons). 

In view of the above facts, we can model the encoding of external information 
on a sensory cortical area (e.g. VI in the visual case) as a particular spike train 
assumed by an input neuron directly exposed to the arriving signal and then 
propagated by coupling through the array. As shown by the series of experiment 
on feature binding [23] , we must transform the local time information provided 
by Eqs (3) and (4) into a spatial information which tells the amount of a cortical 
area which is synchronized. 

If many sites have synchronized by mutual coupling, then the read out prob- 
lem consists in tracking the pattern of values (3), one for each site. Let us take 
for simplicity a continuous site index x. 
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In case of two or more different signals applied at different sites a competition 
starts and we conjecture that the winning information (that is, the one which 
is then channeled to a decision center to act consequentially) corresponds to a 
“majority rule”. Precisely, if the encoding layer is a 1-dimensional chain of N 
coupled sites activated by external stimuli at the two ends (i = 1 and i = IV), the 
majority rule says that the prevailing signal is that which has synchronized more 
sites. 

The crucial question is then :wlro reads that information in order to decide 
upon? We can not recur to some homunculus who reads the synchronization 
state. Indeed, in order to be made of physical components, the homunculus 
itself should have a reading neuron layer as discussed above, followed by some 
interpreter which would be a new homunculus, and so on with a “regressio ad 
infinitum” . 

On the other hand, it is well known that, as we map the interconnections in 
the vision system, VI exits through the Vertical stream and the Dorsal stream 
toward the Infer otemporal Cortex and Parietal Cortex respectively. The two 
streams contain a series of intermediate layers characterized by increasing recep- 
tive fields ; hence they are cascades of layers where each one receives converging 
signals from two or more neurons of the previous layer. Let us take for the time 
being this feed-forward architecture as a network enabled to extract relevant 
information upon which to drive consequent actions. 

We show how this cascade of layers can localize the interface between two 
domains corresponding to different synchronization. It is well known that Hubei 
and Wiesel simple cells with a center-surround configuration perform a first and 
second space derivative. Suppose this operation was done at certain layer. At the 
successive one, as the converging process goes on, two signals will converge on a 
simple cells which then performs a higher order derivative , and so on. This way, 
we build a power series of space derivatives. A translated function as f(x + £) 
is then reconstructed by adding up many layers, as can be checked by a Taylor 
expansion. 

Notice that the alternative of exploring different neighborhoods £ of x by 
varying £ would imply a moving pointer to be set sequentially at different posi- 
tions, and there is nothing like that in our physiology. 

The next step consists in comparing the function f(x + £) with a suitable 
standard, to decide upon its value. Since there are no metrological standards 
embedded in a living brain, such a comparison must be done by comparing f 
with a shifted version of itself, something like the product 

f{x + 0 • f(x- 0 

Such a product can be naturally be performed by converging the two signals 
f(x ± £) onto the same neuron, exploiting the nonlinear (Hebbian )response 
characteristic limited to the lowest quadratic nonlinearity, thus taking the square 
of the sum of the two converging inputs and isolating the double product. 

This operation is completed by summing up the different contributions cor- 
responding to different £, with a kernel which keeps track of the scanning over 
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different £, keeping information on different domain sizes. If this kernel were 
just a constant, then we would retrieve a trivial average which cancels the £ 
information. 

Without loosing in generality, we adopt a Fourier kernel exp(ik^) and hence 
have built the quantity 

/ +oo 

f(x- £)f(x + £)exp(ikt;)d£ (8) 

-oo 

W(x,k) is thus a Fourier transform with respect to £. It is called Wigner-Ville 
function .It contains information on both the space position x around which we 
are exploring the local behavior, as well as the frequency k which is associated 
with the space resolution. 

As well known, it contains the most complete information compatible with 
the Fourier uncertainty 

AxAk = 1 (9) 

Notice that building a joint information on locality (x) and resolution (k) by 
physical measuring operations implies such an intrinsic limitation. 

In summary, it appears that the Wigner function is the best read-out of a 
synchronized layer that can be done by exploiting natural machinery, rather than 
recurring to a homunculus. The local value of the Wigner function represents a 
decision to be sent to motor areas triggering a suitable action. 

Now, let us explore the appropriate function f(x) whose space pattern rep- 
resents the relevant information stored in a sensitive layer. 

The pending problem is how to correlate space and time information. When 
many sites are coupled, there is a fixed time step t* between the occurrence of 
spikes at adjacent sites. Even without a counting for axonal time lags, a delay 
is due to the fact the input form a previous site is sensed only when the driven 
system is within the saddle focus region. This implies a site to site delay, whose 
value depends on whether the coupling is positive or negative. In presence of 
a single input, the overall synchronization takes a time Nt i, N being the site 
number. This correspond to a velocity 

v = (f*) _1 (sites / s) (10) 

This new feature which is peculiar of the Wigner function construction, is what 
in quantum mechanics is called “the Sclrroedinger cat paradox” : if we have estab- 
lished that the measurement consists in building up a Wigner function, then we 
must accept the besides the two patches associated with a live cat and a dead cat, 
in case of justified laboratory uncertainty there must be also an oscillating inter- 
ference(Fig. 9). From 1935 to 1995 it had been considered a pseudo-plrilosophical 
problem; nowadays such extra term has evidence if the superposition refers to 
states of microscopic objects as atoms or photons rather than cats. By the way, 
its existence is crucial for quantum computation. 

The reason why we don’t see interference is “decoherence”. Let us explain 
what we mean. 
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Fig. 9. Wigner distribution of two localized sinusoidal packets shown at the top. The 
oscillating interference is centered at the middle time- frequency location [17]. 



AT 




Fig. 10. Uncertainty areas of two perceptions Pi and P 2 for two different durations 
of the spike trains. In the case of short AT, the overlap region is represented by a 
Wigner function with strong positive and negative oscillations which go as cos(^?-T 
along the T axis; therefore with a frequency given by the ratio of the percept separation 
AP = Pi — Pi to the perceptual “Planck’s constant” C. 



If the system under observation is affected by the environment, then the two 
states of the superposition have a finite lifetime; however, even worse, the inter- 
ference decays on a much shorter time (the smaller, the bigger is the separation 
between the two states): so while the separation is still manageable for the two 
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polarization states of a photon, it becomes too big for the two states of a macro- 
scopic quantum system. Precisely if we call ti the intrinsic decay of each one of 
the two states of the superposition, then the mutual interference in the Wigner 
function decays with the so-called decoherence time 



T~dec — 



D 2 



( 11 ) 



where D 2 is the square of the separation D in phase space between the centers 
of the Wigner functions of the two separate states. Notice that, in microscopic 
physics, D 2 si is measured in units of h. Usually is much bigger than unity for 
a macroscopic system and hence Td ec is so short that any reasonable observation 
time is too long to detect a coherent superposition. Instead, in neurodynamics, 
we perform measurement operations which are limited by the intrinsic sizes of 
space and time resolutions peculiar of brain processes. The associated uncer- 
tainty constant C is such that it is very easy to have a relation as (9) with D 2 /C 
comparable to unity, and hence superposition lifetimes comparable to times of 
standard neural processes. This implies the conjecture that for short times or 
close cortical domains a massive parallelism typical of quantum computation 
is possible. The threshold readjustment due to expectations arising from past 
memory acts as an environmental disturbance, which ortlrogonalizes different 
neural states, destroying parallelism. 
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Abstract. The transitory activity of neuron assemblies has been ob- 
served in various areas of animal and human brains. We here highlight 
some typical transitory dynamics observed in laboratory experiments 
and provide a dynamical systems interpretation of such behaviors. Us- 
ing the information theory of chaos, it is shown that a certain type of 
chaos is capable of dynamically maintaining the input information rather 
than destroying it. Taking account of the fact that the brain works in a 
noisy environment, the hypothesis can be proposed that chaos exhibiting 
noise-induced order is appropriate for the representation of the dynam- 
ics concerned. The transitory dynamics typically observed in the brain 
seems to appear in high-dimensional systems. A new dynamical systems 
interpretation for the cortical dynamics is reviewed, cast in terms of 
high-dimensional transitory dynamics. This interpretation differs from 
the conventional one, which is usually cast in terms of low-dimensional 
attractors. We focus our attention on, in particular, chaotic itinerancy, a 
dynamic concept describing transitory dynamics among “exotic attrac- 
tors” , or “attractor ruins” . We also emphasize the functional significance 
of chaotic itinerancy. 



1 The Background of Lecture I 

Cortical activity appears to be unstable. A single time series of such activity is 
often observed to be “nonstationary” with aperiodic changes between synchro- 
nized and desynchronized states, transitory dynamics between quasi-attractors 
(“attractor ruins” in our words), propagation of activity with various frequencies 
and the appearance and disappearance of phase-related synchronizations. Based 
on model studies, we propose the hypothesis that chaotic itinerancy can be uni- 
versally observed not only in cortical synaptic networks but also in cortical gap 
junction systems, though in the latter it appears in a different way. 



P. Erdi et al. (Eds.): Cortical Dynamics, LNCS 3146, pp. 109—128, 2004. 
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Before proceeding to the main issue, let us raise a naive but fundamental 
question concerning brain function. What is the brain doing? There can, of 
course, be various possible answers, but probably the most fundamental one 
would be that it is interpreting signals, not only those arising from the exter- 
nal world, but also those created within itself. The reason is the following. The 
purpose of cognition may be to discover the nature of objects. The purpose of 
movement may be to influence the rule of behavior, as well as to achieve cogni- 
tive purpose. If complete information is provided in advance of these processes, 
then the brain can uniquely solve the problem concerned and does not need 
interpretation. Usually, however, the information that the brain receive is only 
incomplete one. Therefore, the brain must interpret the information and deter- 
mine its nature. This consideration leads us to the study of hermeneutics of the 
brain [1-6]. 

What kind of level of activity should be considered? The dynamic features 
of the brain might mainly appear in its activity at the mesoscopic level. At the 
microscopic level, that is, at single channel or single neuron levels, the brain 
activity appears to be highly irregular and nonstationary. On the other hand, at 
the macroscopic level, that is, at the level of a functional area or of the whole 
brain, the activity appears to be much less disordered. According to a conven- 
tional theory of phase transitions developed in physics, a macroscopic ordered 
motion can appear as the result of cooperativity of microscopic elements of the 
system concerned. The ordered motion can be described by one or more or- 
der parameters. In equilibrium phase transitions, two thermal equilibrium states 
are interchanged by changes in a control parameter such as temperature, mag- 
netic held, etc. In a neighborhood of the critical point of the transition, complex 
nonequilibrium motions appear. The characteristic scales in space of the sys- 
tem in such a critical regime range from microscopic to macroscopic. Then, the 
time-dependent phenomena can be observed at the mesoscopic level, and such a 
temporal evolution indicates the appearance of ordered motions. 

A similar situation can also occur in far-from-equilibrium systems, where 
in place of thermal equilibrium states various kinds of nonequilibrium station- 
ary states can form the basis of phenomena. The state transitions in far-from- 
equilibrium systems can be described by bifurcations. We here apply this frame- 
work to the transitions that appear during the dynamic brain activity. The term 
“critical point” is replaced by the term “bifurcation point” within this frame- 
work. Dissipative systems are typical far-from-equilibrium systems, where a sta- 
tionary inflow and outflow of energy or matter plays a role in maintaining the sta- 
tionary state. In such a system, order parameters often behave time-dependently. 
Thus, the motions can be described by the time-dependency of quantities such 
as density functions, where a density function is, in general, a function of space, 
time and other quntities such as the membrane potential of neurons and the 
calcium concentrations inside the membrane, namely p(x,s,t) = p(x,(v,c),t). 
Thus, it is called a mesoscopic-level description. A typical equation of motion 
at this level is the Navier-Stokes equation of hydrodynamics. The description of 
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complex spatio-temporal patterns in the brain must be made at the mesoscopic 
level. 

A further crucial problem stems from the fact that the brain works in a 
noisy environment. Its mechanism has not yet been clarified. In this situation, 
we assume that the interplay between noise and dynamical systems provides clue 
for solving the problem of how the brain treats noisy signals. Among others, 
noise-induced order [7], stochastic resonance [8], and chaotic resonance [9] are 
noteworthy. Noise-induced order occurs in a certain class of chaotic dynamical 
systems. At a particular noise level, a transition appears from the chaotic state 
to the ordered state, characterized by the appearance of sharp peak in the power 
spectrum, an abrupt decrease of Kolmogorov-Sinai entropy, and a change of the 
Lyapunov exponent from positive to negative values. The orbit is aggregated 
rather than segregated by the noise. In multi-stable systems, the most stable 
state is usually selected by adding noise, provided that the noise itself does not 
destabilize the stable states. If external noise is superimposed upon a periodic 
force, the state may change to resonate the periodic force. This is termed a 
stochastic resonance. A similar resonance may occur in the chaotic environment, 
where chaos replaces noise. This is called a chaotic resonance. This idea was 
suggested, based on the observations that both chaos and noise are ubiquitous 
in the brain. 

Curious transitory phenomena have been observed in various conditions in 
animal and even human brains. Among others, the typical phenomena appear as 
chaotic transitions between quasi-attractors [10-13], “nonstationary” alteration 
between synchronized and desynchronized states [14], the propagation of wave 
packets of 7- and S- range activity [15,16], and the synchronization of epochs 
with large phase differences that appear irregularly [17]. These curious phenom- 
ena possess the common feature that the autonomic transition between states 
is both chaotic and itinerant. These states can be described by a conventional 
attractor. Actually, there have been a number of interpretation of brain activity 
in terms of the attractor concept. However, a conventional attractor, that is, a 
geometric attractor, is an inappropriate model for the interpretation of these 
transitory dynamics. Because the transitory dynamics appear to be transition 
between states, such a state should be described as unstable. Thus, we must find 
a new model for such a state, in other words, we must introduce a new model for 
the dynamic process such that it allows for both stability and instability. Here, 
instability describes a transition from such a state, and stability describes the 
return to the original state. We must extend this process further to successive 
transitions between multiple states. We have proposed chaotic itinerancy as a 
most appropriate concept for such transitory dynamics [18-20,3,4], 

For further details of the background of this study, in particular, for the sig- 
nificance of the interpretation of brain dynamics in terms of high-dimensional 
dynamical systems and also for the potential role of chaotic itinerancy, the read- 
ers can refer to the recent magnum opus [4] of one of the authors (IT). The 
references on complex systems (see, for example, the reference [22]) are also 
highly recommended. 
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2 Theoretical Basis for the Interpretation 
of Cortical Dynamic Activity 

There seems to be a common mechanism underlying all the dynamic behaviors 
described above. To study such an underlying mechanism, chaotic dynamical 
systems of high dimension and the information theory of chaotic dynamical 
systems are required. 

2.1 Attractors 

We would like to describe the dynamic activity mentioned above in terms of 
the concepts of dynamical systems. Cortical dynamic behavior appears to be 
based on far-from-equilibrium states. To maintain a system in such a state, a 
continuous energy pumping is necessary, because of the energy dissipation that 
is inevitable in all living systems. Thus, it is natural to use the attractor concept. 

Let us, therefore, consider the relationship between a kind of attractor and 
a dynamic state. Stationary or steady states in behavior can be represented by 
a fixed point, namely a point attractor. Furthermore, periodic, quasi-periodic 
and irregular motions can be represented by a limit cycle, a torus and a strange 
attractor, respectively. It should be noted that the term, ‘strange attractor’ was 
proposed [23] as the name for the fourth attractor following a fixed point, a 
limit cycle and a torus, indicating a turbulent state expected to appear after the 
collapse of tori of three and higher dimension by the infinitesimal interactions 
between variables. Hence, rigorously speaking, the use of this concept should 
be restricted to descriptions of attractors that appear after the collapse of tori 
of finite dimension, where such finite dimensionality is precisely the difference 
between this theory and Landau theory of hydrodynamic turbulence. In this 
respect, the strange attractor can be distinguished from ordinary chaotic attrac- 
tors for which different scenarios from the one by successive Hopf bifurcations are 
provided, such as successive period doubling bifurcations or saddle-node bifurca- 
tions. In recent literature, however, the term, ‘strange attractor’ has been used 
to describe all chaotic attractors. In the present article, we follow this conven- 
tional use of the terminology, but have restricted it to relatively low-dimensional 
chaos. 

These four kinds of attractors appear also in lower-dimensional dynamical 
systems, up to three dimensions. At least three dimensions are necessary for the 
appearance of a strange attractor in the case of vector fields, whereas only one 
dimension is enough for the existence of chaos in a discrete map. Thus, three is 
a critical number for the presence of strange attractors. 

Hyperchaos, proposed by Rbssler [24], is defined as possessing plural positive 
Lyapunov exponents, giving rise to the number four as the critical dimensional- 
ity because of the presence of at least two dimensions for instability, at least one 
dimension for dissipation, i.e., for stability, and one orbital dimension. The con- 
cept of hyperchaos may be one of the key concepts for the dynamic description 
of the lrigh-dimensional irregular activity of the brain. It is, however, insuffi- 
cient for the dynamic concepts by which the transitory dynamics is adequately 
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represented (in most cases appearing to be aperiodically itinerant), because lry- 
perchaos does not distinguish between the itinerant transitory dynamics and 
simply chaotic dynamics. Our interest here is a common dynamic representation 
for such transitory dynamics observed in the brain. Thus, a high-dimensional dy- 
namic description is necessary to interpret the transitory and itinerant processes. 
This description may provide a new key to modeling the dynamics of brain ac- 
tivity. We claim that chaotic itinerancy is an adequate dynamic description for 
such processes. 

We can discuss the critical dimensionality of chaotic itinerancy. According 
to Kaneko [25], let us estimate two factors that are supposed to determine the 
dimensionality for the chaotic transition. Let N be the system’s dimension. Let 
us assume that the number of states in each dimension is two, taking into ac- 
count the presence of two stable states separated by a saddle. The number of 
admissible orbits cyclically connecting the subspaces increases in proportion to 
(N — 1)!, whereas the number of states increases in proportion to 2 N . If the 
former number exceeds the latter, then all orbits cannot necessarily be assigned 
to each of the states, hence causing the transitions. In this situation, we expect 
itinerant motions between states. This critical number is six for chaotic itin- 
erancy [25,26]. This critical dimensionality may provide us with the boundary 
between low- and lrigh-dimensional dynamical systems. 

As discussed below, several scenarios describing the appearance of chaotic 
itinerancy can be considered, such as the typical case of the appearance of Milnor 
attractors [29]. A Milnor attractor is, by definition, accompanied with positive 
measure of the orbits attracted to it. It can also be accompanied by the orbits 
repelled from it. The lowest dimension of a Milnor attractor is a fixed point that 
appears in the critical situation of the tangent bifurcation in one-dimensional 
maps or the saddle-node bifurcation in higher dimensions [29,31]. 

2.2 Information Structure and Noise Effect 

So far, we have classified the dynamical systems with the dimensionality of at- 
tractors, and also proposed the use of chaotic itinerancy to give a dynamic 
description for the transitory dynamics observed in the brain. Because the brain 
works in a noisy environment, however, one must consider the effects of noise 
on dynamical systems and the information processing of such noisy dynamical 
systems. 

Other than stochastic resonance, which is now well known as the crucial 
effect of noise with periodic forcing, there can be several effects of noise in the 
presence of chaos. In the case of the presence of a complicated basin structure 
such as seen in, for example, the Kill model of Freeman, the system behaves so 
that the external noise enhances the chaotic response of the olfactory bulb that 
detects an odor [27, 28] . 

Furthermore, the existence of nonuniform chaos can be justified in excitable 
systems such as neural systems and such chaos possesses peculiar characteristics. 
The nonuniformity stems from the nonuniformity of the Markov partition, which 
provides the inherent scale of measure of the Markov states. Thus, this brings 
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about uneven probabilities of the residence time of orbits to Markov states. This 
inherent scale of observation is mismatched with the scale given by external noise, 
for example, the uniform scale when uniform noise is applied. This mismatch 
plays a role in generating ordered motions out of chaos when noise is applied. 
This has been called noise-induced order [7] . 

This property of excitable systems has been claimed to be effective for the 
transmission of input information in chaotic systems [32-34]. For the first time, 
let us review information theory, which clarifies the information structure of low- 
dimensional chaotic systems, and then let us examine the propagation of input 
information in the coupled chaotic systems. Without loss of generality, let us 
consider a differentiable map / defined on the interval /,/:/—>/. We consider 
the information contained in the initial conditions. Because one cannot assign 
the initial conditions with infinite precision, one must consider the distribution 
for the assignment of initial conditions. In the usual computer simulations, this 
assignment may be a uniform distribution, or it may be a Gaussian distribution 
in the case of the usual laboratory experiments. Or, one may consider from the 
beginning, a certain probability distribution as an initial distribution. Thus, we 
consider the information contained in the given distribution p(x) and also its 
evolution according to the evolution of dynamical systems. 

To see this, let us define the Kullback divergence which indicates the relative 
information content of p{x) to qix): 

I ip) = [ dxp(x)log^j^-, (1) 

J qyx) 

where x £ I. The evolution of the distribution is provided by the Frobenius- 
Perrone operator F, which is defined as follows: 



Fp(x) 



E 

f~ 1 (x)=y 



p{y ) 

i mv 



(2) 



where f'(y) indicates the derivative of / with respect to y, and the summation 
is taken over the inverse image of x. 

Let us define the information flow by the difference between the Kullback 
divergence before and after applying the operation. 



AI(p) = I (p)~ I(Fp). 



(3) 



We describe the evolution of information contained in the probability distribu- 
tion relative to the stationary distribution p*(x) of the system. This p*(x) is 
invariant under the operator F, i.e. , Fp*(x) = p*(x). The following equation 
holds if p*(x) is absolutely continuous with respect to the Lebesgue measure, 
that is, if p*(x) — > 0 as the Lebesgue measure m( x) goes to zero. 

Alfp) = J dxp* (x)log\df(x)/dx\. (4) 

The right hand side of this equation is just the Lyapunov exponent. There- 
fore, the information flow is provided by the Lyapunov exponent. Oono [35] and 
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Shaw [36] gave essentially the same treatment. The above formulae, eq. (3) and 
eq. (4), show that the initial slope of the decay of information with respect to 
the evolution of the distribution by F provides the Lyapunov exponent. 

It is, however, not sufficient to reveal the details of the information structure. 
In chaotic systems, a local divergence rate fluctuates according to the distribution 
of derivatives (eigenvalues of Jacobian matrix in high-dimensional maps and 
also in vector fields), which gives rise to the fluctuations in information flow. 
Furthermore, our observation is limited to not only the highest digit, but also 
the lowest one, so that the observation is restricted to a finite window. By the 
Lyapunov exponent, only the information lost at (or flowing out of) the highest 
digit is calculated. We require another quantity by which all the information 
flowing out of the observation window is calculated. Such a quantity is mutual 
information. 

p(j)logp(j) 1 - p{i)p{j/i)logp{j/i) \ (5) 

3 * 3 

where p(j/i) is a transition probability from state i to state j. The mutual 
information is information shared between two states, thus one can calculate the 
information transmitted from one state to other states, and consequently one 
can obtain the detailed structure of the fluctuations of information flow. 

By introducing a time-dependency into the definition of mutual information, 
we can further investigate information mixing. 

I (n) (i-,j) = 'EmwV)- 1 - ^^P{i)P {n \j/i) l °9P {n) {3/iY 1 , (6) 

3 i 3 

where n is a time and p ^ ( j /i) the transition probability from i to j after n time- 
steps. There are typically two cases: (1) a linear decay of the mutual information 
in time and (2) an exponential decay, described as follows. 

(1) I(t) = 1(0) - at. 

Taking the derivative with respect to t on both sides, we obtain dl(t)/dt = — a = 
const. This implies that the same quantity of information decays in each time. 
This situation is the case that the fluctuation of information is very small, so that 
the information structure is adequately described by the Lyapunov exponent. 

(2) I(t) = l(0)exp(—rt) 

Taking the derivative with respect to t on both sides, we obtain dl(t)/dt = 
— rl(t ), thus ( dl/dt)/l = — r = const. This implies that the information quantity 
decays by the same ratio in each time. In this second situation, the fluctuation 
of information is large enough to produce the mixing property of information. 
The mixing property here means that each digit of the variable of dynamical 
system containes the information of any other digits. This is guaranteed by the 
calculations of bit-wise mutual information, because such calculations provide 
the shared information between any two digits. 
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The difference of properties between case (1) and case (2) demonstrates a 
crucial difference in the network properties of coupled chaotic systems, relating 
to the way input information is propagated and the dynamic maintenance of such 
information. The input information to such a network is dynamically maintained 
and is transmitted from one chaotic individual system to the others if each 
chaotic individual system possesses the large fluctuations of information flow as 
in case (2). Propagation of the input information can also be extended to other 
systems placed far from the input. On the other hand, if the network consists 
of chaotic systems that possess small fluctuations of information flow, as in case 
(1), then the network cannot have such properties. 

One tends to think that chaos is useless for information processing, because 
input information appears to be destroyed by the orbital instability of chaos. 
The orbital instability rather brings about a role as information source [35,36], 
because symbol sequences are produced by such orbital instability, provided that 
symbols and decision point (s) that determine the symbols are given. The evolu- 
tion rule of states as a dynamic grammar determines a structure of admissible 
symbol sequences [37]. If the property of information mixing holds in each in- 
dividual system, the input information can be dynamically maintained in its 
networks, in spite of the orbital instability of chaos, as mentioned above. Thus, 
chaotic networks of this kind can be considered to be an information channel. 

The studies of chaos from the viewpoint of information processing have pro- 
duced various ideas concerning the possibilities of the application of chaos to 
studies of the function of the brain and also to studies of the structure of mind. 
John Nicolis and others [38-41] proposed a great variety of discrimination of 
inputs by means of a chaotic system with multiple basins, especially in relation 
to a thalamo-cortical pathway for sensory information. Furthermore, the capac- 
ity of short-term memory (working memory), namely the magic number seven 
plus or minus two that G. Miller [42] found was estimated by means of chaotic 
dynamical systems. Recently, Kaneko has proposed another mechanism to gener- 
ate this magic number with globally coupled map systems [25], which is related 
to the critical number, six for the appearance of chaotic itinerancy. Through 
the intensive and also extensive studies of animal olfaction, Freeman and his 
colleagues have clarified the dynamic mechanism of perception with chaos and 
chaotic itinerancy [11-13]. Taking into account the fact that the chaos produced 
in an excitable system (such as neurons and neuron assemblies) possesses the 
mixing property of information, it turns out that further large-scale networks can 
maintain information in a dynamic manner because of the presence of chaotic 
behavior. It should be noted that chaotic itinerancy also has the property of 
information mixing. 



3 Chaotic Itinerancy 

In this section, we review the characteristics of chaotic itinerancy (Cl) and also 
discuss possible mechanisms for the genesis of chaotic itinerancy. 
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3.1 Dynamic Features of Chaotic Itinerancy 

To show the dynamic features of Cl, we here highlight two systems, both of which 
offer opportunities to study Cl, together with another typical system [18] : one is a 
nonequilibrium neural network model for successive association of memories [43, 
3] and the other is a globally coupled chaotic map [19]. In particular, it seems 
that these models show differences in mechanisms for the genesis of CL 

The neocortical architecture of neural networks has been studied in details. 
There is a common structure in different areas, while a specific one can be 
distinguished in each area. Although the relationship between structure and 
function is clearly one of the main problems in biological evolution and remains 
an open question, it seems likely that there is a strong correlation between them. 
From this point of view, we assume that a common functional feature for any 
information processing is manifested in the common network structure in all 
areas. We considered a dynamic phase of successive association of memories 
to represent such a functional feature. We adopted the network architecture 
described by Szentagothai [44-46] for our model. 

The fundamental architecture of the model consists of the recurrent connec- 
tions of the pyramidal cells and the feedback connections of interneurons, which 
provide randomly fixed positive or negative effects on the membrane potential 
depolarization of pyramidal cells. The effect of interneurons is provided by a 
model of the action of the spiny stellate cells and the basket cells. Such an ar- 
chitecture has been considered a fundamental modular element. The module in 
the model consists of two sub-modules and specific inhibitory neurons. The sub- 
modules are coupled to each other through weak connections from the pyramidal 
cells. Specific inhibitory neurons, which are considered to model the Martinocci 
cells or the double bouquet cells, make synaptic contact with the dendrites of 
the pyramidal cells in only one submodule. 

The dynamics of the model were provided as a stochastic renewal with two 
types of neural dynamics: one is the stationary dynamics that maintain the 
neural state (to the previous state), and the other is the usual deterministic 
neural dynamics such as the integrate-and-fire model, though we used its dis- 
crete version in time. This stochastic dynamics was assumed to stem from the 
stochastic release of synaptic vesicles under the influence of inputs. Thus, this 
dynamics provides synaptic noise. We also introduced an additive noise in the 
model equation, considering function of dendritic noise. 

The model simulations exhibited various crucial dynamic features that have 
been claimed to mimic fundamental functions associated with memory dynam- 
ics, such as successive transitions between stored memories, the reorganization 
of successive transitions after learning of new memories, the enhancement of 
memory capacity and the enhancement of accessibility to memory. The dynamic 
rule for the transitions, in particular, was chaotic, but critical. The skeleton of 
the rule can be described by a one-dimensional circle map, /:/—>/, f(x) = 
x + asin( 4irx) + c(modl),x £ I = [0,1], at the critical phase such that the 
map makes a tangential contact with the diagonal line indicating the identity 
f(x) = x, where x £ I. For example, this occurs for a — 0.08 when c = 0.1. 
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This critical situation often appears at the critical phase of intermittency that 
is realized just at the onset of tangent bifurcation in one-dimensional map, or 
saddle-node bifurcations in higher dimensional systems. This, however, does not 
hold in the present case. The neural network model we used is controlled by many 
parameters including various fixed coupling strengths and an updating rate of 
learning. The similar transitions that are characterized by dynamics possessing 
the same criticality as the case mentioned above are observed as changing the 
values of such parameters as long as the chaotic transition occurs. Therefore, 
we concluded [21] that this critical situation arises not from the bifurcation, but 
from the self- organized criticality [47]. We further claim that this critical situa- 
tion is structurally stable because it remains even after changing the bifurcation 
parameters. 

If cortical dynamics follow only deterministic dynamics such as the critical 
circle map, no transitions from such critical fixed points can occur because the 
critical fixed point x* derived from x = f(x) and f'(x*) = 1 is a typical Milnor 
attractor [29] , which is reached from a set of initial points with positive measure 
(but possessing unstable direction). The critical fixed points obtained represent 
stored memories or learned patterns, so no transitions between memories can 
occur in this situation. A little additive noise plays a role in triggering such 
transitions. Such a role for noise can be justified as a mechanism of Cl. This 
issue will be discussed below. 

Cl can be characterized by several indices. Auto- and cross-correlations de- 
cay slowly, following the power law. Related to these quantities, time-dependent- 
mutual information follows an exponential decay or a power decay. Taking ac- 
count of the dynamic storage of information in the network of nonuniform chaos, 
a macroscopic network consisting of the elemental networks that produce Cl 
may retain input information. An iV-dimensional dynamical system possesses 
an TV-tuple of Lyapunov exponents, the ‘Lyapunov spectrum’, each of which in- 
dicates a long-time average of the eigenvalues of the products of the Jacobian 
matrices, where each Jacobian matrix describes a time evolution of the devi- 
ation of trajectories from a basal trajectory in a linear range, associated with 
the orthonormalization of state vectors. Thus, the indication of chaos, that is, 
the presence of orbital instability, is the existence of at least one positive Lya- 
punov exponent. In the presence of Cl, the Lyapunov spectrum tends to include 
many near-zero exponents. Recently, two important reports were published on 
the Lyapunov exponents of CL Sauer claims that the appearance of sustained 
fluctuations of the zero-Lyapunov exponent is a characteristic of Cl [48]. Tsuda 
and Umemura claim that a further characteristic of Cl is that even the largest 
Lyapunov exponent converges in an extremely slow way, associated with large 
fluctuations [31]. It should be noted that Cl differ from simple chaotic transitions 
in the sense that in Cl, stagnant motions at attractor ruins or near Milnor attrac- 
tors appear. This is characterized by a probability distribution of residence time 
at attractor ruins: a power decay in the case of Cl in the nonequilibrium neural 
network [21]. To explain these characteristics of Cl, we considered the chaotic 
transitions between attractor ruins that are no longer attractors but can attract 
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some orbits. The closest concept to attractor ruin is an attractor in Milnor’s 
sense. For this reason, we have been interested in a Milnor attractor. 

Before defining Milnor attractors, let us remember the conventional definition 
of an attractor, namely a geometric attractor. Let M be a compact smooth 
manifold. Let f : M —> M be a continuous map. We denote a geometric attractor 
by A. A subset N of M satisfying f(N) C inter(N) is called a trapping region, 
where inter(N) indicates an interior of N. The equation A = f'ln'Lo f^ n \N) 
defines an attracting set. A geometric attractor is defined as a minimal attracting 
set, that is, it is an attracting set satisfying a topological transitivity. Thus any 
two points in a neighborhood of an attractor do not move far away from each 
other. 

Milnor extended the definition of attractor [29, 49]. Let us define the attractor 
in Milnor’s sense. Let B be a Milnor attractor. Let p(B) be a basin of B , which 
is defined as p{B ) = {x\lj(x) = B,x € M}. Here lu( x) is an cu-limit set of x. An 
w-limit set is a set of w-limit points, where an w-limit point of x for / is a point 
to which x converges under f( nk \x) as k goes to infinity, given the existence 
of a sequence {n^} such that rife goes to infinity as k goes to infinity. A Milnor 
attractor is defined as a set B satisfying the following two conditions. 

1. p(p(B)) > 0, where p is a measure that is equivalent to Lebesgue measure. 

2. There is no proper closed subset B' C B such that p(p(B)\p(B')) = 0. 

A Milnor attractor can be connected to unstable orbits that are repelled 
from the attractor. This situation in Milnor attractor differs from geometric 
attractors. A Milnor attractor may thus provide a mechanism for allowing both 
transitions from a state and returns to the state, which forms elemental behavior 
of what we have described as CI. This is the reason why we are interested in the 
study of this type of attractor. 

3.2 Possible Mechanisms of Chaotic Itinerancy 

Let us discuss possible mechanisms of CI. It will be fruitful to discuss those with 
a kind of symmetry. 

(1) The case of symmetric systems 

(a) 3-tuple ( Chaotic invariant set, Milnor attractors, riddled basins) 

Let / be a differentiable map, which acts on the phase space M, f : M — > M. 
Let q be a certain group action which acts on M onto itself, q : M — > M. 
Now we assume that the dynamical system commutes with the group action, 
that is, fq = qf. Let S(q) be the invariant set under the action q, that is, 
S(q) = {x\ qx = x,x £ M}. Now take x from such an invariant set S(q). By 
qx = x, f(qx) = fix) holds. By the commutation assumption, q(f x) = fiqx), 
and hence q(fx) = fix). The last equation means that f(x) is also invariant, 
that is, f(x) € Siq). In other words, Siq) is also an invariant set under the 
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dynamics /. It is useful to find an invariant set for the dynamics, becasue while 
it is not necessary to find directly an invariant set under the dynamics, it is 
sufficient to simply find invariant set under a group action. The latter is much 
easier to perform than the former. 

Let us consider a globally coupled chaotic map as a simple but typical ex- 
ample of this kind of symmetric system. 

X n+1 = (! - t)f( x( n) + J^—-[ Y /( 4 J> ). ( 7 ) 

where n is a discrete time step, i a discrete space indicating the position of each 
individual map on the space, N the total number of maps, and each map / is 
defined on the interval I. As a typical example, a chaotic logistic map is used, 
f{x) = ax{ 1 — x), where a is fixed to produce chaos. All individual dynamics are 
assumed to be identical. Let us take a permutation of maps on a discrete space 
as a group action q. It is easily verified that the commutation assumption holds. 
For, evolving the dynamics / for some fixed time T after a certain permutation 
q of the maps on a discrete space is equivalent to permuting the maps by q after 
evolving the dynamics / for T-time steps. 

The simplest invariant set in this model is the synchronized state of all ele- 
ments, z n . Inserting z n into all Xn\ be., x $ = z n in the above model equation 
(7), we obtain z n + 1 = f(z n ), which is, by definition, chaotic. Hence, we have 
a chaotic invariant set. There are many other synchronized states which are 
more complex than this basic synchronization, which was defined as partially 
synchronized state [19]. Partially synchronized state indicates, for instance, a 
combination of synchronized Ni oscillators and another synchronized N — Ni 
oscillators, where N is the system size. These partially synchronized states can 
be an attractor. In this situation, if the Lyapunov exponent in the transversal 
direction to the chaotic invariant set is positive, the basins of such attractors 
have a topologically simple structure. However, if the transversal Lyapunov ex- 
ponent becomes negative via a blowout bifurcation, then the flow toward the 
chaotic invariant set must exist with positive measure, on the one hand, and 
on the other hand, the flow toward attractors retains its positive measure. This 
situation produces a riddled basin structure and the appearance of curious tran- 
sitory dynamics between the attractors with intervening chaotic trajectories. It 
should be noted that the attractors and also the chaotic invariant set are Milnor 
attractors. In this respect, one of the mechanisms of Cl in the symmetric case is 
provided by the presence of the 3-tuple of chaotic invariant set, Milnor attrac- 
tors and riddled basins, when an invariant set produced by symmetry is chaotic. 
However, the situation that chaotic invariant set cannot form the basis of the 
transition may appear. In such a case, it does not seem to be easy to permit 
chaotic orbits to appear between Milnor attractors. Furthermore, there cannot 
be a distribution of finite-time Lyapunov exponent in a transversal direction to 
a fixed point or a periodic point, whereas it is possible for a chaotic invariant 
set [30] and perhaps also for a torus, to exist. 
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Actually, we recently confirmed this matter by constructing a dynamic 
model [ 31 ]. The elementary dynamical system was made to possess a Milnor 
attractor of the fixed point as a basic invariant set. The iV-coupled system with 
nearest neighbor interaction on a circle was considered. The transition of the 
Cl did not occur from the point Milnor attractors, but occurred as a chaotic 
transition between tori. Here, a torus is produced by the interaction of point 
Milnor attractors. 

(b) The possibility of the connection between homoclinic chaos and Milnor attrac- 
tors 

Under a similar symmetry, the saddle connections that are structurally unstable 
in conventional dynamical systems can be structurally stabilized as Gucken- 
heimer and Holmes [ 50 ] have proved. For simplicity, we consider the case of a 
saddle connection between two saddles. Let us call them ,S\ and S'2. The con- 
dition of stabilization for the connection from Si to S2 is that the sum of the 
dimensions of the unstable manifold of Si and the stable manifold of S2 ex- 
ceeds the dimension of the space. The symmetric dynamical system consists of 
invariant subspaces. In each subspace, we can confirm this condition. 

However, with only this kind of stabilization mechanism, chaotic transitions 
cannot be proved. What is a mechanism for allowing chaotic transitions, based 
on the saddle connections? Suppose S2 changes its stability in unstable direc- 
tions to produce neutral stability. In other words, the saddle S'2 is supposed to 
change to become a Milnor attractor, S m 2. In these neutral directions, the orbits 
move away from such a Milnor attractor. Those orbits may make a heteroclinic 
orbit connecting to Si, but this should be structurally unstable for the following 
reason. The fact that the sum of the dimension n“ of the unstable manifold of 
Si and the dimension of the stable manifold of S m 2 exceeds the space di- 
mension N, that is, n“ + n\ > N, indicates (N — n“) + ( N — n|) < N. The 
last inequality means that the sum of the dimensions of the stable manifold of 
Si and of the unstable manifold of S m 2 cannot exceed the space dimension. In 
other words, the intersection of the two manifolds makes only a measure-zero 
set in the phase space. Thus, heteroclinic chaos must be expected in a neigh- 
borhood of Si, as in the Shilnikov type of bifurcation. Chaotic itinerancy may 
occur among the Milnor attractors via chaotic orbits if more saddles exist origi- 
naly. This scenario might explain the chaotic switch between synchronized and 
desynchronized states observed in several networks of class I neurons, which will 
be discussed in the last section. 

( 2 ) The case of asymmetric systems 

What can we state in the case without symmetry? If we start with the sym- 
metric systems and then perturb them to lose that symmetry, then we will find 
a quite similar situation to the symmetric case. Actually, asymmetric connec- 
tions in a GCM, which is obtained by the perturbation of a conventional GCM, 
produce similar characteristics to Cl in the symmetric case. The nonequilibrium 
neural network model for successive association of memories, which is one of the 
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pioneering models of Cl, however, lacks the smooth continuation of the sym- 
metric systems. Although the chaotic transition between fixed point attractors 
in Milnor’s sense requires a little noise, the role of noise in this case seems to 
be different from the symmetric case, because we did not find a riddled basin 
structure for each memory state. Hence, the mechanism of Cl in the asymmetric 
case of this type might be different from the symmetric case. No clear theory has, 
however, so far been proposed, although several concepts have been investigated 
to discover its mechanism. At least three cases can be considered. 

(a) Are saddle connections possible? 

Because symmetry is lost, the space to be considered is a whole phase space, not 
an invariant subspace. Now consider two saddles connecting with each other. As 
discussed above, when the saddle connection cannot be stabilized, chaos appears. 
This consequence can be extended to the cyclic connections of m saddles, that 
is, such connections are not structurally stable, and hence chaos will ensure. 
However, this situation seems to lose stagnant motions such that the orbits stay 
at attractor ruins or near Milnor attractors over a long time. 

(b) Genesis of heteroclinic tangency within a chaotic invariant set 
Heteroclinic tangency may bring about a neutral situation, and hence its ap- 
pearance may give rise to the degeneracy of zero-Lyapunov exponents if such 
tangency becomes dominant, compared with transversality. Stagnant motions 
are expected in a neighborhood of such tangency. This case must be studied in 
details. 

(c ) Milnor attractors associated with fractal basin boundaries 

This is a highly hypothetical situation. It is known that fractal basin boundaries 
separates multiple attractors [51]. With respect to Cl, Feudel et al. found a CI- 
like phenomenon in the double rotor system with small amplitude noise [51]. 
In this system, many periodic orbits coexist, with the higher periodic orbits 
possessing very tiny basins which disappear under the influence of noise, leaving 
only the low periodic orbits. A similar situation to this was found in the Kill 
model by Kozma and Freeman [28], where because of fractal basin boundaries, 
long chaotic transients appear before the system falls into a periodic orbit. Orbits 
are trapped for some time in the vicinity of periodic attractors, but eventually are 
kicked by noise into the fractal boundary region, where the orbits become chaotic 
again, and consequently repeat the transitions between chaotic and periodic 
attractors. This is a noise-induced Cl-like phenomenon. We must consider for 
the deterministic case: how could such attractors become a Milnor attractor? 
This is still an open question. 

4 A Neuron- Assembly Model for Transitory Dynamics 
of Synchronization 

In relation to neurocomputing, Cl was found in many neural systems such as 
nonequilibrium neural networks for successive recall of memories, the neural 
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network model for regenerating episodic memory [52,53], the dynamic associa- 
tive memory model [43,21,54], the modified Hopfield model for the travelling 
salesman problem [55] , and the partially connected network for associative mem- 
ory [56] . The concept of Cl has been adopted to provide a dynamic interpretation 
for the processes of perception and identification [11-13, 16, 28]. These processes 
in animal and human behavior are often observed, accompanied by gamma-range 
oscillations. 

On the other hand, model studies ou the dynamics of the class I neuron and 
its network have recently been highlighted, in accordance with the accumulated 
experimental data on various ion-channels. Furthermore, experimental evidence 
has also accumulated on the ubiquity of gap junction couplings in a large number 
of invertebrate brains and also in the mammalian neocortices. The origin of 
gamma-range activity in neuron assemblies has also focused, in particular, on 
the discovery of gap junction networks and their chemical-synaptic interactions 
with principal cells like pyramidal cells. 

An irregular change between synchronization and desynchronization was also 
studied by using the Morris-Lecar model [57]. Whether or not this irregularity 
stems from chaos has not been studied. Moreover, it is known that this irregu- 
lar alteration of synchronized and desynchronized states only occurs in a very 
narrow parameter region. An irregular and seemingly nonstationary alteration 
between synchronized and desynchronized states has actually been observed in 
animal experiments [12-14,17], although the observed activity level differs in 
each experiment. This level difference may represent the difference of meaning 
of synchronization. 

Physiologically, the functions of class II neurons are characterized by N el- 
and AT-ion channels such as those described by the Hodgkin-Huxley equations, 
whereas the function of class I neurons is further determined, typically by the 
transient, slowly inactivating A- channels in addition to the Na- and K- chan- 
nels. From a mathematical point of view, this functional difference can be re- 
duced to aucl represented by the different type of bifurcations. 

The phase space of a class II neuron, which is defined by the voltage of 
membrane potential, and the levels of activity and inactivity in ion channels, is 
characterized by a distorted vector field, thereby the large excursions of electric 
activity indicating the production of a transient spike. The repetitive firings 
of spikes are represented by a sustained oscillation of a limit cycle oscillator, 
which is produced via Hopf bifurcation. The limit cycle starts as an infinitesimal 
amplitude in the case of supercritical Hopf bifurcation, while on the other hand, 
in the case of subcritical Hopf bifurcation, it starts from a finite amplitude. In 
both cases, it preserves almost the same frequency over a relatively wide range 
of bifurcation parameters, such as the input current. For a large deviation of the 
bifurcation parameter from the bifurcation point, the frequency simply depends 
linearly on the amplitude of the deviation. 

The class I neuron is characterized by the interplay of a saddle-node bifurca- 
tion with a Hopf bifurcation. Because of the presence of a saddle-node bifurca- 
tion, the frequency of the limit cycle oscillator produced by the Hop bifurcation 
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is changed markedly by a slight change of the bifurcation parameter, such as 
the input current. In other words, the dynamic range of the response becomes 
extremely large because of the presence of the saddle. Various mathematical 
treatments of the similarities and differences of both class I and II neurons and 
of their models have been recently systematically reviewed [58]. 

We summarize below the chaotic behaviors obtained in published mathemat- 
ical models of networks of class I neurons with the gap junction. Here the gap 
junction couplings can be modeled by diffusion type couplings. 

(1) Nearest neighbor couplings with gap junctions of Connor-type neurons: 
there are no reports relating to chaotic behavior. 

(2) All-to-all couplings with gap junctions of Morris-Lecar-type neurons: 

the chaotic alteration of synchronized and desynchronized states over a very 
narrow parameter region was observed [57]. 

Recently, we investigated [61] the dynamic behavior of the network consist- 
ing of the nearest neighbor couplings with gap junctions of Morris-Lecar-type 
neurons. We observed chaos over a very narrow parameter region. There have 
not been any observations so far of chaotic alteration between synchronized and 
desynchronized states. 

We have, then, proposed a model of the cell assembly of a subclass, class I*, 
of class I neurons with gap junction- type of couplings, as deduced from the Rose- 
Hindmarsh model [59] . It was found that the spatio-temporal chaotic behavior is 
a typical dynamic behavior in the gap junction-coupled class I neurons, whereas 
pulse propagation as well as spiral wave propagation is typical of gap junction- 
coupled class II neurons [60,61]. More details can be found in the article [62]. 
We have also proposed a simple neuron model with two variables, designed to 
possess both bifurcations mentioned above. This model, called the fx- model, is 
similar to the reduced model of Hindmarsh and Rose [63] . We found a chaotic 
transition between synchronized and desynchronized states in the gap junction- 
coupled fx - model that has similar symmetry to the one mentioned above, that 
is, the whole synchronized state constitutes an invariant subspace. Based on 
numerical studies, we have further proposed a hypothesis that this transition 
can be described as Cl. We are now investigating the mathematical mechanism 
of this chaotic transition, which will be published elsewhere. 

5 Summary and Discussion 

We have reviewed the concept of chaotic itinerancy and several models of chaotic 
itinerancy including the models of neural systems. We have also reviewed the 
possibility of information processing with chaos and chaotic itinerancy, based on 
the information theory of chaos. We have further discussed possible mechanisms 
of chaotic itinerancy, although there are still several ambiguities that require 
resolution. 
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We have finally introduced the most recent study on chaotic itinerancy. This 
study investigates building a new framework for the mathematical modeling of 
neural activity, based on the experimental findings of functional organization at 
the mesoscopic activity level of cell assemblies. We found that a simple model 
neural system consisting of class I* neurons with gap junction-type couplings 
has transitory dynamics similar to chaotic switching between synchronized and 
desynchronized states as observed in several biological neural systems. 

There are varieties of desynchronized states including local metachronal waves 
which are a weakly unstable form of partial synchronization. It is worth inves- 
tigating whether or not the synchronized state can be described by a Milnor 
attractor. Related to this, it will also be worth studying how metachronal waves 
can act to destabilize the synchronized state. Although this neural system has 
a symmetry, it does not seem to be possible to produce a riddled basin. There- 
fore, another scenario must be considered for the presence of Cl in this system. 
Further studies are necessary to clarify the mechanism of the chaotic change of 
synchronization. 
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Abstract. We here review the second lecture, in which we show that 
Cantor coding can be used to hierarchically embed temporal sequences 
produced by a chaotic or a random information source. We also propose 
a hypothesis on the possibility of Cantor coding for the formation of 
episodic memory in the hippocampus and also for the category formation 
of episodic memory. Smale’s horseshoe map defines Cantor coding as a 
conjugacy of expanding dynamics embedded within the map. On the 
other hand, in the real world, this type of conjugacy does not always exist. 
We demonstrate the use of Cantor coding in a mathematical model of the 
hippocampus. The computational results suggest a dynamic mechanism 
for the formation of episodic memory and category formation of episodic 
memory. 



1 The Background of Lecture II 

The aim of the present lecture is to show how information of the time series 
generated by chaotic dynamical system is encoded in Cantor sets generated in a 
contracting subspace of chaos-driven contracting systems, and also to apply this 
idea to the construction of a mathematical model of episodic memory formation. 

Episodic memory is defined by Tulving, as memory concerning the informa- 
tion of individual experiences [1], It should be noted that an individual experience 
is not a series of events that one actually experiences in daily life, but, rather, can 
be identified with the reorganized structure of neural activity created internally, 
associated with the input information during such events. Furthermore, clinical 
reports on the subjects H. M. [2] and R. B. [3] have shown that the hippocampus, 
especially CA1, is indispensable for the formation of episodic memory. 

With respect to mathematical modeling, it is important to note that the 
structure of CA3 is very similar to that of the neural network model of associative 
memory [4, 5] . The model studies since the work of Marr [6] are based on the 
idea that the hippocampus temporarily retains episodic memory as an associative 
memory, as reported by Treves and Rolls [7], and by McClelland et al. [8]. On the 
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other hand, conventional neural network model of associative memory possess 
attractor dynamics only, and thus they cannot create temporal patterns that 
could represent episodic memory. Therefore, to prove the theory proposed by 
Marr, Treves and Rolls, McClelland and others, an additional mechanism to 
produce a temporal series of patterns is required. 

The structural similarity of neocortical networks to the CA3 network is obvi- 
ous. As demonstrated in lecture I, the recurrent networks of excitatory neurons 
with inhibitory interneurons forming a common architecture of neocortical sub- 
areas exhibit a temporal series of memory patterns that show chaotic itinerancy. 
Because CA3 mainly consists of excitatory recurrent networks and inhibitory 
interneurons, the activity of CA3 is expected not to generate stationary spatial 
patterns, but to produce a temporal series of spatial patterns. 

The structure of CA1, on the other hand, lacks the recurrent connections of 
the pyramidal cells, thus the CA1 network may be stable as long as there are no 
synaptic modifications or inputs. Thus, it is possible to model the hippocampus 
as a chaos-driven contracting system. 

It may also be worth considering that this mathematical model provides the 
dynamic characteristics for a reorganized hippocampal network driven by the 
other parts of the brain. 

Let us briefly mention the related studies on Cantor coding. Coding on Cantor 
sets was first demonstrated in an iterated function system (IFS) [9-12] and later 
in a recurrent neural network (RNN) [13-15]. IFS was proposed to realize the 
coding of spatial patterns as fractals on Cantor sets. In particular, Karlin and 
Norman proposed a deterministic model of animal’s stochastic behavior during 
reinforcement learning. A stochastic renewal of plural contacting systems led 
to the generation of Cantor sets. RNN was proposed to realize the Cantor set 
coding of temporal patterns, which were produced as a random series by the 
network. 

Our problem was how to encode the information in the symbol sequences 
generated in a chaotic dynamical system into the system’s contracting subspace. 
We investigated this issue in several chaos-driven contracting systems, where var- 
ious Cantor sets appeared to encode various temporal patterns that the chaotic 
system produced [16-18] . 



2 A Computational Model for the Hippocampus 

The model we propose is a computational one at the macroscopic network level. 
In other words, the concern here is with a basic network model exhibiting and 
encoding a temporal series of patterns. Thus, we do not deal with the details 
of the characteristics of single neurons or even with specificity of the layered 
structure of the network, but rather we try to model the basic network function 
related to the production and encoding of episodic memory. To construct such a 
model for the hippocampus, we first describe the physiological and anatomical 
facts at the network level. This description provides the basic architecture for 
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the model, focusing on the network structure of the CA1 and CA3 regions and 
their interactions. 

The detailed structure of hippocampus is quite different from the neocortex. 
Nevertheless, we would like to focus on the similarities between the two for the 
following reason. As mentioned in lecture I, the network consisting of the recur- 
rent network of pyramidal cells and the inhibitory interneurons, which usually 
act as destabilization by inhibitory cells , produces a chaotic series of memory pat- 
terns. Finding a similar network in the hippocampus is the first step in modeling 
the formation and encoding of episodic memory. 

In CA3, the axon-collaterals of pyramidal cells make synaptic contact with 
other pyramidal cells, and thus form a recurrent network. Furthermore, each 
pyramidal cell makes an excitatory synaptic contact with a neighboring in- 
hibitory cell. Such a cell inhibits the neighboring pyramidal cells for at least 200 
ms in all the physiological conditions studied so far. The other axon-collaterals 
of pyramidal cells, the Schaeffer collaterals, make synaptic contact with the api- 
cal dendrites of the pyramidal cells in CA1. Thus these connections establish a 
unidirectional coupling from the CA3 network to the CA1 network. Mathemat- 
ically, this can be called a skew-product transformation of the dynamics in CA3 
and CA1. 

In CA1, on the other hand, recurrent connections between pyramidal cells 
have not been observed. Inhibitory cells also exist in CA1, each of which mainly 
receives the output of the nearest neighbor pyramidal cell and each in turn in- 
hibits its neighboring pyramidal cells. The axon-collaterals of the CA1 pyramidal 
cells project to various other areas such as the subiculum, the entorhinal cortex, 
and other subcortical areas. 

The inhibitory cells that project on CA3 and CA1 from other areas have rel- 
atively long axon-collaterals that form an interneuronal network [19,20]. Both 
cholinergic and GABAergic afferents project from the septum to hippocampal 
CA3 and CA1. The cholinergic afferents contact both the pyramidal and in- 
hibitory cells in the hippocampus. On the other hand, the GABAergic afferents 
make synaptic contacts mainly with inhibitory cells [21-23]. In fact, Toth et 
al. concluded from their experiments that the GABAergic septo-hippocampal 
afferents selectively inhibit the hippocampal inhibitory cells and consequently 
disinlribit the pyramidal cells. 

Taking into account these anatomical and physiological facts, and also some 
additional hypotheses, the network model of the hippocampus was constructed 
[24]. 

One of the assumptions is that time is discrete that is not necessarily made in 
conventional model of neurodynamics. The reason that we made this assumption 
stems from both theories of cortical information processing based on rhythmic 
cortical activity and experimental facts describing the relation between 6 and 
7 rhythms (see, for example, [25-28]). If 6 rhythms are incorporated in the 
formation of episodic memory, each cycle requires approximately 200 ms for the 
transmission of the episodic signal from the hippocampus back to itself through 
the neocortical areas and the entorhinal cortex [19] . This suggests that a sequence 
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of episodic events is generated during one cycle of the 9 oscillations. On the other 
hand, the 7 rhythms may be related to conscious experience, which is known to 
activate hippocampal neurons. If so, it is possible that the 7 rhythms “filter” 
and discretize continuous signals accompanied by the 6 oscillations. Thus, a 
few (4-8) discrete wave packets having the 7 range frequency may represent a 
sequence of events during one cycle of this signal transmission. Lisman et al [27, 
28] attempted to relate the appearance of such 7 wave packets to the magic 
number 7 ± 2 [29] for short-term memory. As mentioned in lecture I, this magic 
number has also been discussed in relation to chaotic dynamical systems [30, 31]. 
The neural mechanism for the magic number is still an open question, because 
the magic number operates not only in remembering a sequence of numbers over 
a short period of time, but also for inferring the logical depth of a sentence 
that hierarchically contains other sentences, such as “A heard that B told C 
that D loves E”, and also for classifying events and experiences into a number 
of categories. Although the neural mechanism for the magic number may be of 
interest, analysis of it is not our present purpose and we will not consider it 
further. 

Let X and Y be TV-dimensional vectors, with their ith components denoted 
by Xi and respectively, representing the activity of the ith pyramidal and 
inhibitory cells in CA3. For the reasons given above, we introduce a discrete 
time step n for the development of the neurodynamics. The following equation 
provides a 2 N- dimensional dynamical system in CA3 [24] . 

(A(n + 1), Y(n + 1)) = F(X(n),Y(n)), (1) 



where A is a 2iV-dimensional nonlinear transformation denoting the neurody- 
namics. Taking into account the effect of the stimulus-induced stochastic release 
of synaptic vesicles widely observed in the hippocampus, however, we interpret F 
as representing a stochastic renewal of neurodynamics , as in the case of Tsuda’s 
model of successive association of memories [32-35] . 

The stochastic renewal of neurodynamics for the itli set of pyramidal and 
inhibitory cells is defined as follows. 



f H x (jf J2f=i w ij x j ~ diVi) (with probability p x ) 
\ Xi (n) (otherwise) 

f Hy{jf J2jLi e ij x i) (with probability p y ) 

\ 0 (otherwise), 



(2) 

(3) 



where a \ G [— l,l](i = 1 , - • • , AT) , y, G [— l,l](i = 1,---,TV), denotes a 
synaptic weight from pyramidal cell j to i, di € [0, (3\ a synaptic weight from the 
inhibitory cell i to the pyramidal cell i, and G [0, a] a synaptic weight from the 
pyramidal cell j to the inhibitory cell i. Furthermore, the neural transformation 
is given by the sigmoid function. 



H x (z) = H y (z ) = 2/(1 + exp(-X 3 z)) - 1. (4) 



Here A3 is a steepness parameter. 
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Semantic memories encoded in the parahippocampal area are here repre- 
sented by a vector, each component of which denotes the activity of pyramidal 
cell. The memories are assumed to be constructed in the recurrent network of 
pyramidal cells by the Hebbian learning algorithm; 

w U = jf'52 x i x j> ( 5 ) 

where x f is the ith component of the /itli memory, X M = (x±, ■ ■ ■ , x^). 

We hrst describe what could happen in the CA3 network in the case without 
the effects of the inhibitory neurons. This condition of the network, denoted 
by di = 0 for all i, can occur under the physiological conditions such that the 
inhibitory neurons in CA3 are inhibited by the GABAergic afferents coming from 
the septum. In this case, as in the conventional model of associative memory, the 
network dynamics become attractor dynamics. Some attractors are generated to 
represent semantic memories by the above Hebbian learning algorithm. 

We next describe what could happen in the CA3 network in the case without 
the effects of GABAergic neurons from the septum, namely, with the effects of 
the inhibitory neurons of CA3 alone. This situation is similar to the one in the 
dynamic associative memory model [32-38] . As expected, the chaotic transitions 
between memories occur through chaotic itinerancy [39, 40, 33] between attractor 
ruins representing the memory traces (see also [41]). The chaotic sequence of 
memory patterns generated in such a way may be considered a representation 
of episodic memory. 

Let us further describe our model for the skeleton of the CA1 network and 
its dynamics. 

Let U and V be M-dimensional vectors of the space [0, 1] M , U = (tii, ■ ■ • , um ) 
and V = (vi, ■■■,%), where it, and i \ denote the states of pyramidal cell i and 
stellate cell i, respectively. Here, M is the number of cells of each type. 

i ion! ■ (ti') — l - 

Ui(n+ 1) = r b 3 2 + 6»), (6) 

3 3 

v i ( n ) = bij Uj (n) , (7) 

3 

x' j {n)=Xj(n ) (xi(n) > 0), (8) 

x'j(ri) = —Xj(n) (otherwise) (9) 

Here, Ty denotes the synaptic connection from the CA3 pyramidal cell j to the 
CA1 pyramidal cell i, and its value is fixed to a randomly chosen value from 
the uniform distribution on [0, 1]. In the simulation [24], we set bu = 1.0 and 
b-ij = 0 if i ^ j, and cu = 1.0 and Cy = 0 if i ^ j. It would be worth studying 
network performance with different arrangements of connections, but here we 
review only the results in this restricted case. Because the Shaeffer collaterals 
of the CA3 pyramidal cells are excitatory, we introduced the terms xt . We also 
introduced the control term <5 because GABAergic inhibitory neurons from the 
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septum make synaptic contact with the stellate cells in CA1. We assumed 6 << 1 
in the presence of GABAergic inhibitory inputs from the septum, and S = 0(1) 
in the absence of such disinhibitions. 

The input-output transformation for the pyramidal cells is also the sigmoid 
function. 

ff u (z) = 1/(1 + exp(-Aiz)), (10) 

where Ai is a steepness parameter. 

3 Does Cantor Coding Provide a Biologically Plausible 
Representation of Information in the Hippocampus? 

In the simulation of the CA1 model, we chose the parameter values that satisfied 
the inequality that establishes the contraction condition. We thus obtained a 
chaotic dynamics-driven contracting system as a computational model for the 
CA3-CA1 unidirectional couplings at the network level. We conducted computer 
experiments step by step in three separated stages, in each of which the inputs 
to CA1 pyramidal cells differed. 

(1) Inputs of the random sequence of two temporal patterns, ‘100’ and ‘10’ 

The symbol T’ indicates the presence of a pulse and ‘0’ the absence of a pulse, 
so the random input sequence consists of two different pulse sequences. The ran- 
domness of the sequence was provided by a coin tossing algorithm. The computa- 
tion results showed a hierarchical coding of such random temporal sequences on 
the Cantor set. In any two-dimensional space projected from the iV-dimensional 
space of the membrane potential of pyramidal cells in CA1, say the space of 
ui — U 2 , a nonoverlapping Cantor set is clearly seen (see Fig. 1). In Fig. 1, 
whenever the pulse denoted by ‘1’ appears in input, the activity of iq and tq 
is plotted at the next time step. The plotted dots constitute a self-similar set, 
called a Cantor set, in the following way. The activity shown by the two subsets 
on the right upper part and on the left lower part of the figure indicate that the 
pulse sequences including the present pulse were ‘10*’ and TOO*’, respectively. 
Here, the symbol “*’ denotes arbitrary random sequences of TOO’ and TO’, and 
the right hand side in the sequence indicates the past. The upper two subsets 
indicate that the sequences including the present pulse were ‘1010*’ in the subse- 
quent upper subset and ‘10100*’ in the subsequent lower subset, respectively. In 
a similar way, the left lower two subsets indicate ‘100100*’ and ‘10010*’, respec- 
tively. This shows a hierarchical representation of the embedding of the random 
temporal series consisting of TOO’ and TO’. 

(2) Inputs of random sequences of continuous spatial patterns 

We define a continuous spatial pattern by a vector, with each component repre- 
senting the activity of each pyramidal cell in CA3 that is sent to all pyramidal 
cells in CA1, via Slraeffer collaterals of the CA3 pyramidal cells. The activity 
is represented by a real number in the unit interval, [0, 1]. A random sequence 
consisting of different patterns acted as the temporal input to CA1, and the 
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Fig. 1. A Cantor coding of random temporal series consisting of two pulse sequences, 
‘10’ and ‘100’. N = 64, Ai = 50, e = 0.032, S = 0.06, 0 = 0, fcy = 1 (i = j) and b {j = 0 
(otherwise), and Qj = 1 {i — j) and Cij = 0 (otherwise). 



- 


10 * 


,* 

s 


1010 * 


- 


** 

** 


10100 * 


- 


100 * 


* 100100 * 

9 






# 


10010 * 






0.4 


0.45 0.5 


0.55 


0.6 



iti 



sequence is hierarchically embedded in the IV-dimensional space of pyramidal 
cell activity (see reference [24] for a concrete example of the embedding). 

(3) Inputs of the temporal output series from the pyramidal cells in CA3 
This case corresponds to the actual physiological situation. Our model CA3 pro- 
duces chaotic itinerancy of memory patterns, where a temporal series of memo- 
ries represents an episode. Although the Cantor set is manifested in CA1 phase 
space, overlaps between subsets are generally observed. It is usually difficult to 
separate such overlapped subsets, so their appearance creates difficulties in de- 
coding the embedded temporal patterns. Thus, the overlapping must be resolved 
so that the proposed algorithm will apply to the hippocampus. This important 
study will be published elsewhere. 

In all of the three cases, the history of the temporal sequence is represented 
by the hierarchy of Cantor sets in phase space, hence the name Cantor coding. 

4 Summary and Discussion 

We proposed a computational model for the formation of episodic memory. The 
model was constructed to represent the basic network of the hippocampal CA3 
and CA1, and their interactions. The main characteristic of CA3 dynamics is the 
generation of a temporal series that can be described as chaotic itinerancy [41]. 
It was shown that a certain kind of chaotic behavior provides a dynamic informa- 
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tion channel, and hence such chaotic behavior should also be expected to work 
computationally in the brain [36,41]. On the other hand, the main characteris- 
tic of CA1 dynamics is a contaracting dynamics if it is isolated so that there is 
less bombardment from CA3 changing the strength of connections. Thus, overall 
activity should follow chaos-driven contracting dynamics. 

Generally, in chaos-driven contracting dynamics, the information read out by 
chaotic dynamics is written in the contracting space by the contracting dynamics, 
as is seen in the dissipative baker’s transformation. In other words, a symbol 
sequence created by chaotic dynamics is represented as a subset of a Cantor 
set generated in the contracting subspace. A code table can thus be dynamically 
formed on a Cantor set that is generated in the phase space in CA1 pyramidal 
cells. This is what we would expect in CA1 with weak or no synaptic learning: 
the demonstration of a one-to-one correspondence between each symbol sequence 
generated by chaotic dynamics in CA3 and each position of a Cantor element in 
CA1, by virtue of unidirectional coupling from CA3 to CA1, i.e., CA3 — » CA1. 

The distance between (or the closeness of) semantic memories (or events ex- 
perienced) represented by a spatial pattern of neuronal activity could be specified 
in CA3 by a scalar product between all the two patterns. On the other hand, 
the distance between the different temporal series may be defined in CA1 by the 
Euclidian norm between the corresponding two points in the Cantor set. In fact, 
it can be defined through the hierarchies of the Cantor set. Thus, the concept 
of distance between episodic memories could be realized in CA1. Therefore, the 
category formation for a variety of episodes may be expected to be realized in 
CA1 by the clustering of subsets of the Cantor set. This clustering will effec- 
tively be established via synaptic learning, i.e., LTP (see, for example, [42]). For 
such a categorization process for a temporal series of events, a Hausdorff metric 
between Cantor subsets can be used. 

Our hypothesis on Cantor coding is depicted in Fig. 2. 

In the formation of episodic memory, the relationship between the temporal 
series of spatial patterns in CA3 and the geometry with a Hausdorff metric of the 
Cantor set in CA1 may be flexibly altered, whereas in the cortices the alteration 
of the representation by structural changes will vary slowly. In this respect, 
the hippocampus may be likened to a blackboard. Writing and erasing on this 
hippocampus “blackboard” could be related to the 9 -rhythms, thus taking about 
200 ms. The timing between this writing and erasing and the slowly varying 
transition between symbols in the cortex is a key to the formation of episodic 
memory. This illustrates the necessity of a long period of time, from perhaps, a 
few years to a lifetime, for the complete formation of episodic memory. This can 
be understood by taking account of the existence of retrograde amnesia for one 
to three years as well as anterograde amnesia after hippocampal deprivation [2], 
and also after CA1 lesions [3]. 

Whether the Cantor coding actually occurs in CA1 can be examined in the 
laboratory experiments that must initially be conducted under a nonlearning 
condition, using a blocker for LTP, followed by experiments to observe alteration 
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Hypothesis 




Fig. 2. Hypothesis on Cantor coding. See the text for details. 



of the set structure in the absence of such a blocker. The three stages illustrated 
above will correspond to an actual procedure for the stages of experiments. 
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Abstract. Although it is generally recognized that “interneurons gen- 
erate a variety of synchronous inhibitory rhythms in the neocortex. . . ” 
(J. R. Gibson et al. ) and they “may play a key role in coordinating cor- 
tical activity. . .” (M. Galarreta & S. Hestrin), little is known how they 
behave in the in vivo neocortex. A salient property of some interneu- 
ron systems in the neocortex is that they are coupled by gap junctions 
(GJs) - a kind of electrical couplings very intensively between the same 
type of interneurons. In our previous studies, we reported the theoretical 
possibility that a class of neuron systems may exhibit spatio-temporal 
chaos when they are coupled by GJs, while the individual neurons, when 
isolated, exhibit only simple repetitive firings. This dynamics is emer- 
gent, and unveils only when cells are coupled by GJs. Mathematically, 
this phenomenon could be an expression of chaotic itinerancy among 
pseudo-attractors (or, attractor ruins). In view of the ubiquity of GJs - 
there are at least five distinct interneuron systems coupled by GJs in the 
six layers of the neocortex, and in view of the significance of the concept 
of chaotic itinerancy in memory dynamics we give in this lecture a review 
about general property and collective dynamics of GJ-coupled neuronal 
systems. 



1 Introduction 

1.1 Background 

Recent physiological data recognizes the massive presence of gap junctions (GJs) 
among interneuorns in the neocortex, which poses serious questions about the 
organization of neocortex, the possible role of interneurons in neocortical rhythm 
generations and the role of interneurons in cognitive functions. In our previous 
studies (Nakano et al., 2003 [21]; Fujii and Tsuda, 2004 [8]), we reported the the- 
oretical possibility that a class of neuron systems may exhibit spatio-temporal 
chaos when they are coupled by GJs, while the individual neurons, when iso- 
lated, exhibit only simple repetitive firings under injected background currents. 
This dynamics is emergent, and unveils only when cells are coupled by GJs. 
Mathematically, this phenomenon could be an expression of chaotic itinerancy 
among pseudo-attractors (or, attractor ruins), which could be characterized as 
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transiently synchronized states. In view of the ubiquity of GJs - there are at 
least five distinct interneuron systems coupled by GJs in the six layers of the 
neocortex, and in view of the significance of the concept of chaotic itinerancy 
and transient synchrony in memory dynamics (Tsuda and Fujii [32], this issue), 
we give in this lecture a review about general property of collective dynamics 
in GJ-coupled neuronal systems. The emphasis here is on the itinerant behavior 
made by model neuron systems of realistic cortical neurons. 

Gap junctions exist in, as mentioned in the above, at least five distinct in- 
terneuron systems in the six layers of the neocortex. Among FS (fast spiking) 
neurons in the layer 2/3, among FSs and among LTS (low threshold spiking) neu- 
rons in the layer 4, among FSs in the layer 5/6. Recently, a new member joined 
to this GJ-coupled interneuron groups: multipolar bursting (MB) interneurons 
in the layer 2/3. (Blatow et al, 2003 [1].) Gap junctions are not only ubiquitous, 
but massive ([9], [10], [29]). They form distinct networks, and some of them pos- 
sess chemical synaptic connections, too. Although it is generally believed that “a 
(gap junction-coupled) network of FS cells in the neocortex may play a key role 
in coordinating cortical activity...” (M. Galarreta & S. Hestrin, 1999 [9]), or 
“interneurons generate a variety of synchronous inhibitory rhythms in the neo- 
cortex. . . ”(J. R. Gibson et al., 1999 [10]), little is known how those interneurons 
collectively behave when coupled by GJs in a massive way. In view of the fact 
that various distinct interneuron systems are involved in GJ couplings, one may 
be led to ask such general questions as: what kind of ionic channel properties 
or, what kind of nonlinearity in mathematical terms may lead to what kind of 
collective dynamics when they are coupled by GJs? In particular, one may ask 
what kind of neuronal processes may give rise to such an emergent chaotic dy- 
namics? These questions, together with their possible role in cognitive functions, 
have not been well understood from both mathematical and physiological points 
of view. 

We may also envisage possible relations with some dynamical phenomena 
observed in the neocortex, e.g., transient synchrony and fluctuation in local field 
potentials (LFP) ([11]). See, also discussions in Tsuda and Fujii [32]. 

1.2 Gap Junctions in the Neocortex — A Quick Review 

It is already three decades ago that the presence of gap junctions in primates 
neocortex was firstly reported ([28]). Very recently it is generally recognized that 
the GJs are really massive and ubiquitous in the neocortex - but only among 
interneurons. (See, e.g., [1], [9], [10] and [29]). 

1. GJ couplings exist in the layers 2/3, 4, and 5/6. In the layer 2/3, there are 
two distinct systems: FS and MB interneurons. MBs may send chemically- 
induced spikes to FSs, but not vice versa (Blatow et al. [1], 2003). The layer 4 
has two distinct GJ-coupled systems: FS and LTS-systems. The two systems 
may also be coupled by gap junctions each other, but the couplings are not 
so dense ([9], [10], [29]). The layers 5/6 include a GJ-coupled FS interneuron 
system. 
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2. GJ-couplings are neuron type specific within the same type of neurons, with 
an exception: FS-LTS in the layer 4. 

3. Connectivity within a neuron group is not fully understood, but connections 
are massive. For example, (although there are some inconsistency among 
known data), one may say that the GJ couplings are far more massive (or, 
at least not less) than synaptic connections, although functions of the two 
“couplings/connections” as well as connectivity are not equal. See, [9], [10] 
and [29]. Interplay of those two connection systems should play important 
roles in neocortical rhythm genesis. 

In view of its dense GJ-couplings, an interneuron system, if it is GJ- 
coupled, may better be regarded primarily as a GJ-coupled system. Chemi- 
cal connections may play their role on the basis of the intrinsic “rhythms” 
induced by GJs. 

4. Origins of EPSPs (projected on GJ systems): some evidence exits that they 
are fed in a feed forward manner, but details are largely unknown yet, in- 
cluding the possibility that they might receive some top down “signals” from 
other cortical areas etc. Dantzker and Callaway. ([5]) reported that both the 
FS interneurons and pyramidal cells (PYR) in the layer 1 /2 are fed from the 
layer 4 in a parallel way. It appears that FSs do not receive (at least strong) 
EPSPs from PYRs of the same layer, while PYRs are inhibited by these FSs. 
There are other interneuron groups (probably, with no GJ couplings) which 
receive excitatory inputs from PYRs. This gives some idea about the general 
organization of PYR.-FS cell interactions. Whether and how this GJ-coupled 
FS interneuron system receives inputs from other cortical/non-cortical areas 
as “top down” or “control” signals is not known. 

2 Single Cell Behavior of Class I Cells — Some Examples 

2.1 Historical Background 

Hodgkin pointed out in 1948 that there exist two classes of “neurons” (actually, 
“axons”). In his paper of 1948 [15] (before he published the famous paper de- 
scribing the Hodgkin-Huxley (HH) equations) he wrote that class I “axons are 
capable of repetition over a wide range of frequencies, varied smoothly over a 
range of about 5-150 impulses per sec.”, and class II “axons usually give a train 
of impulses of frequency 75-150 /sec which was relatively insensitive to changes 
in the strength of the applied current” . 

Rogawski commented in his review ([25], 1985) that the difference of the dy- 
namic range property, that is, of spike firing characteristic, originates from the 
presence of I a -current, a transient slowly inactivating potassium current. I a 
is an outward potassium current, differs from the rectifying K + current Ik, in 
several important aspects, i.e., in its kinetics, in that it activates at more hy- 
perpolarized level of membrane potential, and blocked by 4-AP, less sensitive to 
TEA (a blocker of Ik). I a does not exist in squid axons on which the HH model 
is based. He stressed that this fact provides the basis of the class I characteristic 
of the most of cortical neurons. 
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It was Connor and his coworkers ([4], 1971; [3], 1977) who studied the “class 
I” neurons with Ia channel from model-theoretic standpoints. They derived 
Hodgkin-Huxley type equations with six variables, two among them coming from 
additional activation/inactivation states of the / 4 -current. Based on the Con- 
nor’s work, Rose and Hindmarslr ([26], 1989) reduced the Connor equations into 
two-dimensional equations with the spirit to approximate the six variable equa- 
tions by a simpler system, and still keeping explicit relations to the original ionic 
currents. Similar attempts have been made to the original HH equations by, e.g., 
Rinzel ([23]) and others. Wilson ([33a,b], 1999) presented two- variable equations 
similar to the Rose-Hindmarsh equations to qualitatively approximate “essential 
dynamics” of neocortical class I and II neurons. 

The argument of Rogawski ([25]) may be controversial in a few aspects. After 
his review appeared, it become recognized that there are at least two /^-currents: 
I As (or, Id) IaJi with difference in kinetics, activation/inactivation level and 
sensitivity to 4-AP. Rogawski’s arguments may have been based on these two 
currents in a mixed-up way. Next, the class I property does not necessarily comes 
from / 4 -currents. For example, the Morris-Lecar model, which was designed to 
“model” the crustacean neurons, with only Ca ++ and K + channels included, 
may have the class I property. We note, however, that it is generally recognized 
that most of cortical neurons are of class I (see, e.g., Cauli, [2]) in the sense of 
Hodgkin. But the ionic currents responsible to their class I characteristic appear 
to be not fully understood. 

The concept of class I/II of neurons may be better understandable in a 
mathematical context. Namely, the difference of dynamic behavior between these 
two classes is attributed to that of generation mechanism of action potentials: 
homoclinic bifurcations (class I), and subcritical Hopf bifurcations (class II). In 
this respect, we note two points. Firstly, the homoclinic bifurcations do not 
mean a unique concept if viewed from a dynamical systems standpoint. As we 
shall see later, one may observe saddle-node on a limit cycle bifurcations for 
the Connor neuron family, and saddle separatrix loop bifurcations typically seen 
in the Morris-Lecar neurons in their class I regime. (See, also, Izhikeviclr [17].) 
This taxonomy, identified by the orbits in the phase plane and the bifurcation 
diagram, appears subtle, but it will affect the nature of collective behavior when 
coupled by GJs. The second note is that the HH model itself is of class II in most 
of biologically appropriate parameter values. It is reasonable to assume that a 
Hopf singularity (i.e., bifurcation point) is built-in in the HH equations, and in 
fact it can be observed numerically. 

Although the concept of “class” of neurons and that of bifurcations as well 
do not depend on the number of variables, we proceed our arguments based 
primarily on a reduced form of two- variables, where the first variable, say V, 
may represent the membrane potential, and the second one, R , the “recovery” 
variable representing an activation state of, e.g., some potassium channels in a 
generalized sense [23], [26]. With the injected current strength being denoted by 
i, the single cell equation may be written as: 
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Tv ^ F = Av, R ) + i 

r R § = 9 (V,R) 



(1) 



All the ion channel characteristics are “condensed” in the nonlinearity of the 
functions of / and g here - but only in an approximate sense. The correspondence 
between the original ionic channel-based equations of HH-type and (1) is not 
one-to-one, but many-to-one. Note also that if one considers to describe a more 
general dynamic behavior of single cells like “bursting”, “adaptation”, etc. which 
are observed in pyramidal cells, or regular spiking interneurons and so on, one 
need to add at least another one variable to represent “slow” dynamics. However, 
in this lecture we restrict our discussions to equations of the type (1), and focus 
on the collective behavior of such cells coupled by GJs, rather than detailed 
dynamics of single isolated cells. 



2.2 Class I and II Neurons in Two- Variable Models 

Let us begin with the two- variable model of the form (1). We denote by A/y and 
A fn the nullclines of the V and R equations: A/y = { (V,R) \ f(V,R)+i= 0 } 
and A fn = {(V,R) \ g(V,R)= 0 }, respectively. A/y and A fn correspond to the 

sets of points in (V,R)~ space where — — = 0 and — — = 0 hold, respectively. 

LLL UjL 

Within the framework of two- variable reduced model of the form (1), A/y takes 
a cubic-like form in most cases. On the other hand, the form of A fn characterizes 
the neuron property. As we shall give particular examples, the class I neurons 
are characterized by quadratic-like A/k’s, and the class II neurons by essentially 
linear A/k’s. 

The FitzHugh-Nagumo (FHN) model ([7], [20]) is one of the most known 
class II “neurons”, which has an inclined I-shaped A fn- Since A/y is essentially 
cubic, the point of intersections of A/y and A fn shifts according to the level of 
the injected current i, taken as a bifurcation parameter. For class II neurons, the 
intersection of the two nullclines is (in most cases) unique, and the nature of 
the intersection changes from a stable node (two real negative eigenvalues) to 
an unstable spiral, (i.e., a pair of complex conjugate eigenvalues with positive 
real parts). Between those two states, there appears a pair of pure imaginary 
eigenvalues which correspond to a Hopf bifurcation point. 

For the convenience of later discussions, we introduce a few models of class 
I and II in explicit forms in the following. 



Example 1. Morris-Lecar Model — Regime I and II 

Although the Morris-Lecar equation was firstly introduced to model crustacean 
neurons with Ca ++ and K + channels, it has frequently been used as a theoretical 
model. (Rinzel-Ermentrout [24], Izhikeviclr [17].) This ML model can be both of 
class I and II depending on its parameters. 
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-tt = ~9Ca.m 00 (V)(V - 1) + g K R(V - V K ) + g L (V - V L ) + i 
at 



— = q[Roo(V) - R]/t r {V) 

where 

moofy) = [1 + tanh{(V - V a )/V b }\/2 
Roo(V) = [1 + tanh{{V - V c )/V d }]/2 (2) 

t r (V) = l/cosh{(F - V c )/{2V d )} 



V„ = -0.01, V b = 0.15 ,g Ca = 1.0 ,g K = 2.0 
9l = 0.5, = -0.7, V L = -0.5, q = 1.15 





Fig. 1. Two regimes of the Morris-Lecar model. Two nullclines are drawn in the phase 
( V, R )-space. {Left) Class I regime: V c = 0.1, V d = 0.145, q = 1/3; i = 0.083 {Right) 
Class II regime: V c — 0, V'/ - 0.3, q=0.2; i = 0.2;0.3;0.4. 



Fig. 2 shows the orbit in the phase space in the case of Class I regime at the 
saddle separatrix loop bifurcation where the unstable and stable manifolds make 
a closed loop, i.e., a homoclinic orbit. As the current-level i increases, this loop 
splits from the saddle. Since the new born loops are close enough to the saddle, 
their period is very large just after they split from the saddle. This is the class 
I formation mechanism of spikes in the Morris-Lecar I case. Although this may 
provide an example of class I characteristic in Hodgkin’s sense, those “spikes” 
are actually small membrane potential oscillations in a depolarized state. 



Example 2. A Model of Connor-Rose-Hindmarsh (CRH) Family 

The Rose and Hindmarsh model based on the Connor equations is also writ- 
ten in the form of (1), where Mu takes a quadratic-like U-shape. Let us consider 
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Fig. 2. Phase diagram of the Morris-Lecar model in Class I regime. The diagrams 
illustrate the birth of a saddle separatrix loop bifurcation. The left diagram shows the 
orbits when the current i is below the bifurcation point. As i increases, the unstable 
and the stable manifolds from the saddle (the cross mark) form a closed loop, i.e. , a 
homoclinic orbit (center). Then, the loop splits from the saddle and forms a limit cycle 
( right ). 
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Fig. 3. Bifurcation diagram of the Morris-Lecar model in Class I regime. At the critical 
current value i=i*, there appears a homoclinic orbit. Filled circles indicate bifurcated 
stable branch of “spikes”, while the open circles are an unstable branch of spikes. 



(1) with the nonlinearity in the form of (3), following the formulation of Rinzel 
[23] and Wilson (1999, [33a]): 

/ /(V, R) = -m 00 (y)(F - V Na ) - g K R(V - V K ) , ^ 

\g(V,R) = -R + R 00 (V) 

Nakano et al. ([21], 2003) considered a class I model of the form (1), (3), where 



moo (V) = 5.36 + 17.04V + 16.9V 2 , itoo(V) = 1.29V + 0.53 + 3.3(V + 0.18 ) 2 ,g K = 11.0 t 
V Na = 0.48, Vk = -0.95, t v = 1.0, t r = 2.4 W 



which is a modified version of Wilson’s Class I model (1999, [33a]). Eqs.(l), (3) 
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“narrow channel” 
saddle-node 

stable node (equilibrium) 
saddle 

unstable spiral 



phase plane 



bifurcation diagram 



Fig. 4. Phase diagram of the Connor-Rose-Hindmarsh model. As the injected current 
i increases from the rest state, the nullclines become to have three intersections, i.e. , 
a stable node (an equilibrium), a saddle and an unstable spiral. Then, the stable node 
and the saddle collide at a critical current value, where a homoclinic orbit appears from 
the saddle-node. See, also Fig. 5. This is the seed of action potentials with the period 
= oo. Note that after the current % traverses the saddle-node bifurcation point, there 
appears a “narrow channel” between the two nullclines. 



(4) have an unstable spiral instead of the unstable node in the Wilson model. 
See, Fig. 4. 

The number of intersections of A f\> and Mu varies from one to three, and 
then again back to one for such neurons when the current level i increases from 
the rest state. There appears a saddle-node bifurcation point at the critical level 
of i where the saddle and the node collide as is shown in Fig. 4 when i = 
0.07. The salient property of the reduced Connor family is that as the injected 
current i traverses the saddle-node bifurcation point, there appears a narrow 
channel between M\> and Mu- See, the top picture of Fig. 4, at * = 0.18. Rose 
and Hindmarsh [26] emphasized that this is a consequence of the presence of 
/^-current. Fig. 5 illustrates how a saddle-node bifurcation yield a homoclinic 
orbit in the phase plane. 
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Fig. 5. Formation of limit cycles at a saddle-node bifurcation point for the Connor- 
Rose-Hindmarsh neuron, (left) There are two heteroclinic orbits from the saddle to the 
node (left), one of which becomes a large homoclinic orbit when the saddle and the 
node collide (center). Then, as the saddle-node disappears, the remaining closed loop 
turns into a limit cycle (right). 

For the purpose of comparison, Nakano et al. ([21]) also considered a class II 
neuron model by Wilson (1999, [33a]) with a linear nullcline M-r,- 

'mooOO = 17.81 + 47.71G + 32.63F 2 , 

Roo(V) = 1.35F+1.03, f) 

g K = 26.0, Vno. = 0.55, 1 j 

Vk = — 0.92, Ty = 0.8, Tr = 1.9 

Fig. 6 shows instantaneous spike rates as functions of injected current 
strengths for the class I (Eqs. (1), (3) and (4)) and class II (Eq. (5)) models. 
The lower box indicates the class I firings as the current increases. 

3 Class I vs. Class II Neurons under GJ Couplings — 

A Zoology 

In this section we give a zoology of spatio-temporal dynamics emerged by GJ- 
coupled system of neurons of class I and II. The aim here is to understand 
the variety of potential dynamics intrinsic to class I and II neuronal systems 
that may emerge when coupled by GJs. With this in mind, we examine the GJ- 
coupled dynamics under a simple situation: under uniform GJ-couplings without 
synaptic connections, i.e. , with no internal irregularity of the network structure 
and with no external fluctuation of injected currents. This may help us to know 
the nature of intrinsic coupled dynamics: the difference between cells of class I 
and II, between two prototypes of the class I neurons etc.. 

For instance, Nakano et al. [21] have shown that the modified Connor-Rose- 
Hindmarslr system, i.e., Eqs. (1), (3) and (4) with nearest neighbor couplings 
exhibits extensive spatio-temporal chaos under a spatially and temporally con- 
stant injected current which puts the system near the saddle-node critical level. 
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Fig. 6. Spike frequency characteristics of class I and II neurons (top) and the class I 
spike generation of a class I model due to Eqs. (1), (3) and (4) (bottom). 

As is shown in the previous section (Fig. 6), those neurons show only repetitive 
firings when isolated. Han et al. [12a, b] reported that the Morris-Lecar model in 
the class I regime exhibits dephasing of oscillations under all-to-all couplings. 

In the actual cortex GJ couplings are obviously not uniform, and the spatial 
connectivity is not fully understood. There are also chemical synapses among 
interneurons which may exert inhibitory effects on the network dynamics. Po- 
tential and emergent dynamics intrinsic to GJ-coupled systems we introduce 
here may provide a basis for the study of interplay of the GJ-induced dynam- 
ics with those synaptic interactions, and interneuron-PYR neuron interactions. 
Note that the FS-PYR structural configuration mentioned in Sect. 2.1 (Dantzker 
and Callaway [5]) may justify our strategy to study as a first step the intrinsic 
dynamics of interneurons. 

We first summarize the results of numerical works done by Nakano et al. [21] 
in the following. Then, we discuss the collective dynamics of ML systems. 

3.1 Gap Junction Couplings 

Gap junctions are assumed to take the following form according to Schweiglrofer 
et al. [27]. 
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For j 



~ + Ij + 

T n^r = a( v F R j) 



( 6 ) 



where Jj’s stand for injected background currents (which are assumed to be 
constant in time and space in this simulation), while Jj’s represent the currents 
induced by gap junction couplings: 



Jj = g GJ y ^( Vnbj — Vj),(n,bj £ coupled neigbor cells) 

nbj 



( 7 ) 







gap junction coupling g GJ 



© 



one-compartm eat tivo-v (triable 
reduced neuron model 



u 



injected current spatially uniform, 
and temporally constant 



Fig. 7. The configurartion of GJ-couplings considered. Neurons are arranged as a two 
(or, one) dimensional array. GJ-couplings are either 4 or 8 (or, 2) nearest neighbor- 
couplings. Coupling conductances <?gj’s are assumed to be constant. 

Here, g GJ is a coupling constant, which is assumed to be identical for all con- 
nections in the present simulation. When the system is two (one)-dimensionally 
lined up with four (two) neighbors coupling, this is the nearest neighboring cou- 
pling equivalent to a linear discrete diffusion process. For cells at the boundaries, 
the couplings are only with the inner cells according to the definition by Eq.7. 
(Usually, this is referred to as the Neumann boundary condition.) 

A note about the coupling term Jj’s. Sclreweiglrofer et al. ([27]) formulated 
the GJ-coupling term with a mild nonlinearity. However, since we found no 
qualitative difference at least from collective dynamics standpoints, we simply 
assume the linear discrete GJ effects (7) throughout this lecture. 

3.2 Chaos in Two Class I Systems 

a. Nakano et al. Model — A Member of the Connor-Rose-Hindmarsh Family 

The model defined by Eqs. (1), (3) and (4) of class I neuron of the Connor-Rose- 
Hindmarsh family exhibits extensive spatio-temporal chaos with, even spatially 
and temporally constant, injected currents near or over the saddle-node critical 
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point. In general, this GJ-coupled classl system has several typical dynami- 
cal patterns: synchronous repetitive brings, periodic metachronal 1 brings, and 
spatio-temporal chaos as well as synchronous resting state, depending on the 
GJ conductance and the background current level. Note that chaos appears very 
robustly with regard to the background current strength, the strength of GJ 
conductance and even the coupling structure. See, Figs. 8, 9 and 12 below. The 
maximal Lyapunov exponents are positive in such cases. Individual cells exhibit 
extensive buctuations both in phase and magnitude. 




Fig. 8. Chaotic dynamics in the phase plane and the membrane potentials of three 
representative neurons among 10x10 GJ-coupled system of Eqs. (1), (3) and (4). 




Fig. 9. The snapshots of GJ-coupled class I neuron systems of Eqs. (1), (3) and (4). 



b. Morris-Lecar Model in Class I Regime 

The Morris-Lecar model has a parameter range of class I regime. Han et al. 
([12a,b]) have observed dephasing of oscillations under all-to-all couplings. In 
our nearest neighbor couplings, we can also observe alternation of phases of 

1 A metachronal wave is a wave produced by successive phase shifts of neighboring 
neurons’ activity like the movement of centipedes or cilium and flagella. 
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small amplitude oscillations at a depolarized potential level. Also the orbit inter- 
mittently itinerates around the saddle equilibrium. Although a snapshot of the 
spatial pattern appears to be similar to the Connor family, the dynamics in the 
phase plane of individual cells seems very different in that the orbit oscillates 
around the limit cycle in a “depolarized” state and never turns back to the rest 
state. See, Fig. 10. 




Fig. 10. Behavior of GJ-coupled Morris-Lecar model in regime I. The saddle separatrix 
bifurcation occurs approximately at i = 0.07293. The class II regime, on the other hand, 
has plane traveling waves {right). 



3.3 Gap Junction-Coupled Class I and Class II Dynamics 

We show some typical spatio-temporal patterns exhibited by class I and II neu- 
rons. The first row of Fig. 11 below shows the chaotic dynamics of 80x80, 4- 
neiglrbor coupled Class I neurons. The next two rows indicate both class II 
model behaviors with the same configuration as the above case. Since Class II 
equations originate as models of action potential propagations in axons, it is 
natural that they exhibit traveling waves or spiral waves when they are coupled 
by GJs in a one- or two dimensional array. This appears in the third row, while 
they exhibit also complicated spatially non-uniform periodic patterns as in the 
second row. Morris-Lecar equations of regime II show plane traveling waves. 
(See, Fig 10.) 
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Fig. 11. Typical spatio-temporal patterns of Class I neurons defined by Eqs. (1), (3) 
and (4) (top), and class II neurons Eqs. (1), (3) and (5) ( center and bottom). 

3.4 Spatial Scales of Chaotic Patterns 

Fig. 12 (a), (b) show snapshots of membrane potential contours made by 80x80 
coupled Connor family neurons Eqs.(l), (3) and (4), with increasing gap junction 
conductance g GJ . This shows how robustly spatio-temporal chaos appears; it 
appears in a wide range of parameters near the saddle-node bifurcation points, 
and even for far larger injection currents if g GJ is smaller. Fig. 12 (a), (b) are the 
cases for 4- and 8-nearest neighbor couplings. Also, the size of spatial patterns 
become larger when the coupling g GJ is set stronger. 
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Fig. 12. Snapshots of contours of membrane potentials of 80x80 class I* neurons, (a) 
(top) 4nearest neighbor couplings (b) (bottom) 8-nearest neighbor couplings The gap 
junction strength g GJ is, from left to right, 0.5, 1.5 and 3. 







